Method for reading non-volatile storage using pre-conditioning waveforms and modified reliability metrics

ABSTRACT

Data stored in non-volatile storage is read using sense operations and associated pre-conditioning waveforms. The pre-conditioning waveform provides a short term history for a non-volatile element which is analogous to the conditions experienced during programming when a programming pulse is applied prior to a verify operation. The pre-conditioning waveform can cause electrons to enter and exit trap sites, for instance, so that the accuracy of a probabilistic decoding process is improved. In one approach, multiple read operations are performed, some with pre-conditioning waveforms and some without. Pre-conditioning waveforms with different characteristics, such as amplitude, shape, duration and time before the associated read pulse, can also be used. For probabilistic decoding, initial reliability metrics can be developed based on multiple reads. Tables which store the reliability metrics can then be prepared for use in subsequent decoding.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is related to commonly assigned: U.S. patentapplication Ser. No. 11/693,668, filed herewith on Mar. 29, 2007 (nowabandoned), titled “Non-Volatile Storage With Reading UsingPre-Conditioning Waveforms and Modified Reliability Metrics”; U.S.patent application Ser. No. 11/693,640, filed herewith on Mar. 29, 2007,published as US2008/0250300 on Nov. 9, 2008, titled “Method for DecodingData in Non-Volatile Storage Using Reliability Metrics Based On MultipleReads”; and U.S. patent application Ser. No. 11/693,672, filed herewithon Mar. 29, 2007 (now abandoned), titled “Non-Volatile Storage WithDecoding of Data Using Reliability Metrics Based On Multiple Reads”,each of which is incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to non-volatile memory.

2. Description of the Related Art

Semiconductor memory has become increasingly popular for use in variouselectronic devices. For example, non-volatile semiconductor memory isused in cellular telephones, digital cameras, personal digitalassistants, mobile computing devices, non-mobile computing devices andother devices. Electrically Erasable Programmable Read Only Memory(EEPROM) and flash memory are among the most popular non-volatilesemiconductor memories. With flash memory, also a type of EEPROM, thecontents of the whole memory array, or of a portion of the memory, canbe erased in one step, in contrast to the traditional, full-featuredEEPROM.

Both the traditional EEPROM and the flash memory utilize a floating gatethat is positioned above and insulated from a channel region in asemiconductor substrate. The floating gate is positioned between thesource and drain regions. A control gate is provided over and insulatedfrom the floating gate. The threshold voltage (V_(TH)) of the transistorthus formed is controlled by the amount of charge that is retained onthe floating gate. That is, the minimum amount of voltage that must beapplied to the control gate before the transistor is turned on to permitconduction between its source and drain is controlled by the level ofcharge on the floating gate.

Some EEPROM and flash memory devices have a floating gate that is usedto store two ranges of charges and, therefore, the memory element can beprogrammed/erased between two states, e.g., an erased state and aprogrammed state. Such a flash memory device is sometimes referred to asa binary flash memory device because each memory element can store onebit of data.

A multi-state (also called multi-level) flash memory device isimplemented by identifying multiple distinct allowed/valid programmedthreshold voltage ranges. Each distinct threshold voltage rangecorresponds to a predetermined value for the set of data bits encoded inthe memory device. For example, each memory element can store two bitsof data when the element can be placed in one of four discrete chargebands corresponding to four distinct threshold voltage ranges.

Typically, a program voltage V_(PGM) applied to the control gate duringa program operation is applied as a series of pulses that increase inmagnitude over time. In one possible approach, the magnitude of thepulses is increased with each successive pulse by a predetermined stepsize, e.g., 0.2-0.4 V. V_(PGM) can be applied to the control gates offlash memory elements. In the periods between the program pulses, verifyoperations are carried out. That is, the programming level of eachelement of a group of elements being programmed in parallel is readbetween successive programming pulses to determine whether it is equalto or greater than a verify level to which the element is beingprogrammed. For arrays of multi-state flash memory elements, averification step may be performed for each state of an element todetermine whether the element has reached its data-associated verifylevel. For example, a multi-state memory element capable of storing datain four states may need to perform verify operations for three comparepoints.

Moreover, when programming an EEPROM or flash memory device, such as aNAND flash memory device in a NAND string, typically V_(PGM) is appliedto the control gate and the bit line is grounded, causing electrons fromthe channel of a cell or memory element, e.g., storage element, to beinjected into the floating gate. When electrons accumulate in thefloating gate, the floating gate becomes negatively charged and thethreshold voltage of the memory element is raised so that the memoryelement is considered to be in a programmed state. More informationabout such programming can be found in U.S. Pat. No. 6,859,397, titled“Source Side Self Boosting Technique For Non-Volatile Memory,” and inU.S. Patent App. Pub. 2005/0024939, titled “Detecting Over ProgrammedMemory,” published Feb. 3, 2005; both of which are incorporated hereinby reference in their entirety.

Once a non-volatile storage element has been programmed, it is importantthat its programming state can be read back with a high degree ofreliability. However, the sensed programming state can sometimes varyfrom the intended programming state due to trap site noise and otherfactors. Perhaps the most important source of noise is 1/f noise(including random telegraph signal noise) which is a result of electrontrapping and de-trapping into trap sites located in the tunnel oxide orelsewhere.

SUMMARY OF THE INVENTION

The present invention addresses the above and other issues by providinga method for reading non-volatile storage using pre-conditioningwaveforms.

In one embodiment, a method for operating non-volatile storage includesapplying at least a first pre-conditioning waveform to at least onenon-volatile storage element in association with a first read operation,performing the first read operation on the at least one non-volatilestorage element, performing a second read operation on the at least onenon-volatile storage element, and determining a programming state of theat least one non-volatile storage element responsive to the first andsecond read operations.

In another embodiment, a method for operating non-volatile storageincludes applying at least a first pre-conditioning waveform to at leastone non-volatile storage element prior to, and in association with, afirst read operation, performing the first read operation on the atleast one non-volatile storage element, applying at least a secondpre-conditioning waveform to the at least one non-volatile storageelement prior to, and in association with, a second read operation,performing the second read operation on the at least one non-volatilestorage element, and determining a programming state of the at least onenon-volatile storage element responsive to the first and second readoperations.

In another embodiment, a method for operating non-volatile storageincludes performing a plurality of sense operations on a set ofnon-volatile storage elements for each of a plurality of programmingstates, where at least one pre-conditioning waveform is provided for atleast a first of the sense operations, providing a set of reliabilitymetrics based on the sense operations, and storing the set ofreliability metrics for use by an iterative probabilistic decodingprocess in determining a programming state of at least one non-volatilestorage element in the set of non-volatile storage elements in at leastone subsequent sense operation.

In another embodiment, a method for operating non-volatile storageincludes applying at least a first pre-conditioning waveform to at leastone non-volatile storage element in association with a first readoperation, performing the first read operation on the at least onenon-volatile storage element, and determining a programming state of theat least one non-volatile storage element responsive to the first readoperation using iterative probabilistic decoding.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a top view of a NAND string.

FIG. 2 is an equivalent circuit diagram of the NAND string of FIG. 1.

FIG. 3 is a block diagram of an array of NAND flash storage elements.

FIG. 4 depicts a system for encoding and decoding of data fornon-volatile storage.

FIG. 5 a is a flowchart of a process for obtaining a first probabilitydensity function f1 for a set of non-volatile storage elements.

FIG. 5 b is a flowchart of a process for obtaining a second probabilitydensity function f2 for a set of non-volatile storage elements.

FIG. 6 a depicts a distribution of voltage threshold readings.

FIG. 6 b depicts noise-free voltage threshold readings.

FIG. 6 c depicts a distribution of noise-free voltage thresholdreadings.

FIG. 6 d depicts probability distributions of voltage threshold fordifferent states of a set of non-volatile storage elements.

FIG. 7 a depicts a distribution of voltage threshold deviations for anexample state of a non-volatile storage element.

FIG. 7 b depicts a distribution of voltage threshold deviations for anexample state of a set of non-volatile storage elements.

FIG. 7 c depicts a probability distribution of voltage thresholddeviations for State 0 for a set of non-volatile storage elements.

FIG. 7 d depicts a probability distribution of voltage thresholddeviations for State 1 for a set of non-volatile storage elements.

FIG. 7 e depicts a probability distribution of voltage thresholddeviations for State 15 for a set of non-volatile storage elements.

FIG. 8 is a flowchart of a process for obtaining logarithmic likelihoodratios (LLRs) for use in decoding read data from a non-volatile storageelement.

FIG. 9 a depicts a table which provides multi-bit code words fordifferent programmed states of a non-volatile storage element.

FIG. 9 b depicts a table which provides initial values of LLRs for eachbit of a code word based on a first read result.

FIG. 9 c depicts a table which provides adjustments to current values ofLLRs used by a decoder for each bit of a code word based on a secondread result.

FIG. 10 a-d depict tables which provide initial values of LLRs for eachbit of a code word based on first and second read results.

FIG. 11 a is a flowchart of a process for decoding a code word whichrepresents a state of a non-volatile storage element, where initialprobability metrics are obtained based on first and second readoperations.

FIG. 11 b is a flowchart of a process for decoding a code word whichrepresents a state of a non-volatile storage element, where initialprobability metrics are obtained based on a first read operation, thenadjusted probability metrics are adjusted further based on a second readoperation.

FIG. 11 c is a flowchart of a process for decoding a code word whichrepresents a state of a non-volatile storage element, where initialprobability metrics are obtained based on a first read operation, thennew initial probability metrics are obtained based on the first readoperation and a second read operation.

FIG. 12 depicts a sparse parity check matrix.

FIG. 13 depicts a sparse bipartite graph which corresponds to the sparseparity check matrix of FIG. 12.

FIG. 14 a is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a selected word line before an associated readpulse.

FIG. 14 b is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where one or morepre-conditioning waveforms are applied to a selected word line beforeassociated read pulses.

FIG. 14 c is a timing diagram that depicts different pre-conditioningwaveforms.

FIG. 14 d is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a drain of a selected storage element via aselected bit line before an associated read pulse.

FIG. 14 e is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a source of a selected storage element via asource line before an associated read pulse.

FIG. 14 f is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a selected storage element via a body bias beforean associated read pulse.

FIG. 14 g is a flowchart of a process for performing a read operation ona storage element, where pre-conditioning waveforms are applied to astorage element before associated read pulses.

FIG. 14 h is a flowchart of a process for performing a read operation ona storage element, where a pre-conditioning waveform is applied to astorage element before a series of read pulses.

FIG. 14 i is a flowchart of a process for obtaining reliability metricsusing pre-conditioning waveforms for subsequent use in decoding.

FIG. 15 is a block diagram of an array of NAND flash storage elements.

FIG. 16 is a block diagram of a non-volatile memory system using singlerow/column decoders and read/write circuits.

FIG. 17 is a block diagram of a non-volatile memory system using dualrow/column decoders and read/write circuits.

FIG. 18 is a block diagram depicting one embodiment of a sense block.

FIG. 19 illustrates an example of an organization of a memory array intoblocks for an all bit line memory architecture or for an odd-even memoryarchitecture.

FIG. 20 depicts an example set of threshold voltage distributions.

FIG. 21 depicts an example set of threshold voltage distributions.

FIGS. 22 a-c show various threshold voltage distributions and describe aprocess for programming non-volatile memory.

FIG. 23 is a flow chart describing one embodiment of a process forprogramming non-volatile memory.

FIG. 24 depicts an example pulse train applied to the control gates ofnon-volatile storage elements during programming.

DETAILED DESCRIPTION

The present invention provides a method for reading non-volatile storageusing pre-conditioning waveforms.

One example of a memory system suitable for implementing the presentinvention uses the NAND flash memory structure, which includes arrangingmultiple transistors in series between two select gates. The transistorsin series and the select gates are referred to as a NAND string. FIG. 1is a top view showing one NAND string. FIG. 2 is an equivalent circuitthereof. The NAND string depicted in FIGS. 1 and 2 includes fourtransistors, 100, 102, 104 and 106, in series and sandwiched between afirst select gate 120 and a second select gate 122. Select gate 120gates the NAND string connection to bit line 126. Select gate 122 gatesthe NAND string connection to source line 128. Select gate 120 iscontrolled by applying the appropriate voltages to control gate 120CG.Select gate 122 is controlled by applying the appropriate voltages tocontrol gate 122CG. Each of the transistors 100, 102, 104 and 106 has acontrol gate and a floating gate. Transistor 100 has control gate 100CGand floating gate 100FG. Transistor 102 includes control gate 102CG andfloating gate 102FG. Transistor 104 includes control gate 104CG andfloating gate 104FG. Transistor 106 includes a control gate 106CG andfloating gate 106FG. Control gate 100CG is connected to word line WL3,control gate 102CG is connected to word line WL2, control gate 104CG isconnected to word line WL1, and control gate 106CG is connected to wordline WL0. The control gates can also be provided as portions of the wordlines. In one embodiment, transistors 100, 102, 104 and 106 are eachstorage elements, also referred to as memory cells. In otherembodiments, the storage elements may include multiple transistors ormay be different than that depicted in FIGS. 1 and 2. Select gate 120 isconnected to select line SGD (drain select gate). Select gate 122 isconnected to select line SGS (source select gate).

FIG. 3 is a circuit diagram depicting three NAND strings. A typicalarchitecture for a flash memory system using a NAND structure willinclude several NAND strings. For example, three NAND strings 320, 340and 360 are shown in a memory array having many more NAND strings. Eachof the NAND strings includes two select gates and four storage elements.While four storage elements are illustrated for simplicity, modern NANDstrings can have up to thirty-two or sixty-four storage elements, forinstance.

For example, NAND string 320 includes select gates 322 and 327, andstorage elements 323-326, NAND string 340 includes select gates 342 and347, and storage elements 343-346, NAND string 360 includes select gates362 and 367, and storage elements 363-366. Each NAND string is connectedto the source line by its select gates (e.g., select gates 327, 347 or367). A selection line SGS is used to control the source side selectgates. The various NAND strings 320, 340 and 360 are connected torespective bit lines 321, 341 and 361, by select transistors in theselect gates 322, 342, 362, etc. These select transistors are controlledby a drain select line SGD. In other embodiments, the select lines donot necessarily need to be in common among the NAND strings; that is,different select lines can be provided for different NAND strings. Wordline WL3 is connected to the control gates for storage elements 323, 343and 363. Word line WL2 is connected to the control gates for storageelements 324, 344 and 364. Word line WL1 is connected to the controlgates for storage elements 325, 345 and 365. Word line WL0 is connectedto the control gates for storage elements 326, 346 and 366. As can beseen, each bit line and the respective NAND string comprise the columnsof the array or set of storage elements. The word lines (WL3, WL2, WL1and WL0) comprise the rows of the array or set. Each word line connectsthe control gates of each storage element in the row. Or, the controlgates may be provided by the word lines themselves. For example, wordline WL2 provides the control gates for storage elements 324, 344 and364. In practice, there can be thousands of storage elements on a wordline.

Each storage element can store data. For example, when storing one bitof digital data, the range of possible threshold voltages (V_(TH)) ofthe storage element is divided into two ranges which are assignedlogical data “1” and “0.” In one example of a NAND type flash memory,the V_(TH) is negative after the storage element is erased, and definedas logic “1.” The V_(TH) after a program operation is positive anddefined as logic “0.” When the V_(TH) is negative and a read isattempted, the storage element will turn on to indicate logic “1” isbeing stored. When the V_(TH) is positive and a read operation isattempted, the storage element will not turn on, which indicates thatlogic “0” is stored. A storage element can also store multiple levels ofinformation, for example, multiple bits of digital data. In this case,the range of V_(TH) value is divided into the number of levels of data.For example, if four levels of information are stored, there will befour V_(TH) ranges assigned to the data values “11”, “10”, “01”, and“00.” In one example of a NAND type memory, the V_(TH) after an eraseoperation is negative and defined as “11”. Positive V_(TH) values areused for the states of “10”, “01”, and “00.” The specific relationshipbetween the data programmed into the storage element and the thresholdvoltage ranges of the element depends upon the data encoding schemeadopted for the storage elements. For example, U.S. Pat. No. 6,222,762and U.S. Patent Application Pub. 2004/0255090, both of which areincorporated herein by reference in their entirety, describe variousdata encoding schemes for multi-state flash storage elements.

Relevant examples of NAND type flash memories and their operation areprovided in U.S. Pat. Nos. 5,386,422, 5,522,580, 5,570,315, 5,774,397,6,046,935, 6,456,528 and 6,522,580, each of which is incorporated hereinby reference.

When programming a flash storage element, a program voltage is appliedto the control gate of the storage element and the bit line associatedwith the storage element is grounded. Electrons from the channel areinjected into the floating gate. When electrons accumulate in thefloating gate, the floating gate becomes negatively charged and theV_(TH) of the storage element is raised. To apply the program voltage tothe control gate of the storage element being programmed, that programvoltage is applied on the appropriate word line. As discussed above, onestorage element in each of the NAND strings share the same word line.For example, when programming storage element 324 of FIG. 3, the programvoltage will also be applied to the control gates of storage elements344 and 364.

FIG. 4 depicts a system for encoding and decoding of data fornon-volatile storage. Data which is stored in non-volatile storage canbe encoded and decoded in a way which mitigates the effects of noise.Perhaps the most important source of noise is the 1/f noise (includingrandom telegraph signal noise) which is a result of electrons trappingand de-trapping into trap sites located in the tunnel oxide orelsewhere. The noise is not so much a result of the loss of the channelelectron going into a trap site as it is due to the fact that theelectron/hole in the charged trap site affects the flow of otherelectrons in the channel by the electric field that the charged trapsite exerts on a region of the channel in its vicinity. Moreover, theregion of the channel that is under the influence of a single trap sitewill form a larger portion of the channel as the storage elements arescaled down.

Many noisy storage elements suffer from a single trap site, and thisconclusion is based on the binary nature of their current values, thatis, a storage element has one current/V_(TH) value if the trap isoccupied and another distinct current/V_(TH) value if the trap isunoccupied. Thus, when storage elements are biased under DC conditionsequivalent to the read condition, many storage elements exhibit abimodal distribution of current values with two narrow distributionshaving peaks that are substantially separated from each other. However,some storage elements suffer from more than a single noisy trap site.Moreover, not every trap site can lead to noisy behavior, as there mayexist trap sites that are consistently empty or consistently occupiedduring read conditions. Also, trap sites which are in easy communicationwith some electrode, and as a result make a large number of transitionsbetween being empty and being occupied during any single integrationtime (e.g., read period), will manifest little or no noise as theiraverage effect is more or less the same for any integrationtime/measurement operation. This can be explained by the averagingconcept or, more precisely, by the Central Limit Theorem. Also, theoccupation probability of trap sites can be modulated by the electricfield that the trap sites find themselves immersed in. Further, thosetrap sites that are more detrimental to read operations are those withlonger occupation/inoccupation life times. Such trap sites can bethought of as parasitic memory devices that interfere with the normaloperations of the memory. Write/erase cycles can and do createadditional trap sites, and lead to more noise.

As a result, read operations can be impacted by noise in a storageelement. Although error correction coding and decoding schemes canaddress some errors cause by noise and other factors, additionaladvantages can be achieved by performing multiple read operations asexplained herein. An example approach is depicted in theencoding/decoding system of FIG. 4, which includes an encoder 402,non-volatile storage 404, LLR (log likelihood ratio) tables 406 and adecoder 408. The encoder 402 receives information bits, also referred toas user data, which is to be stored in the non-volatile storage 404. Theinformation bits are represented by the matrix i=[1 0]. The encoder 402implements an error correction coding process in which parity bits areadded to the information bits to provide data represented by the matrixor code word v=[1 0 1 0], indicating that two parity bits have beenappended to the data bits. This is a simplified example which results ina high parity bit overhead cost. In practice, codes with lower overheadcan be used. For example, low density parity check (LDPC) codes, alsoreferred to as Gallager codes, may be used. Such codes are typicallyapplied to multiple code words which are encoded across a number ofstorage elements so that the parity bits are distributed among thestorage elements. Further information regarding LDPCs can be found in D.MacKay, Information Theory, Inference and Learning Algorithms, CambridgeUniversity Press 2003, chapter 47. The data bits can be mapped to alogical page and stored in the non-volatile storage 404 by programming anon-volatile storage element to a programming state, e.g., X=6, whichcorresponds to v (see FIG. 9 a). With a four-bit data matrix v, sixteenprogramming states can be used.

Subsequently, when it is desired to retrieve the stored data, thenon-volatile storage is read. However, due to noise, as mentioned, theread state can sometimes be errored. In one example approach, a firstread Y1 yields the programming state 7 which is represented by the codeword y1=[1 0 1 1], and a second read Y2 yields the programming state 6which is represented by the code word y2=[1 0 1 0]. In one possibleimplementation, an iterative probabilistic decoding process is usedwhich implements error correction decoding corresponding to the errorcorrection encoding at the encoder 402. Further details regardingiterative probabilistic decoding can be found in the above-mentioned D.MacKay text. The iterative probabilistic decoding attempts to decode acode word by assigning initial probability metrics to each bit in thecode word. The probability metrics indicate a reliability of each bit,that is, how likely it is that the bit is not errored. In one approach,the probability metrics are logarithmic likelihood ratios (LLRs) whichare obtained from LLR tables 406. LLR values are measures of thereliability with which we know the values of various binary bits readfrom storage elements.

The LLR for a bit is given by

${Q = {\log_{2}\frac{P\left( {v = {0\text{❘}Y}} \right)}{P\left( {v = {1\text{❘}Y}} \right)}}},$where P(v=0|Y) is the probability that a bit is a 0 given the conditionthat the read state is Y, and P(v=1|Y) is the probability that a bit isa 1 given the condition that the read state is Y. Thus, an LLR>0indicates a bit is more likely a 0 than a 1, while an LLR<0 indicates abit is more likely a 1 than a 0, based on one or more parity checks ofthe error correction code. Further, a greater magnitude indicates agreater probability or reliability. Thus, a bit with an LLR=20 is morelikely to be a 0 than a bit with an LLR=10, and a bit with an LLR=−20 ismore likely to be a 1 than a bit with an LLR=−10. LLR=0 indicates thebit is equally likely to be a 0 or a 1.

An LLR table 406 can be provided for each of the four bit positions inthe codeword Y1 so that an LLR is assigned to each bit 0, 0, 1 and 0,respectively, of y1. Further, the LLR tables can account for themultiple read results so that an LLR of greater magnitude is used whenthe bit value is consistent in the different code words. Thus, an LLRwith a greater magnitude can be assigned to the first bit in y1 than ifonly one read operation was performed. To illustrate, the first bit iny2 is 1, which is consistent with the first bit in y1. Likewise, an LLRof lesser magnitude is used when the bit value is inconsistent in thedifferent code words. For example, the fourth bit in y2 is 0, which isinconsistent with the fourth bit in y1. Thus, an LLR with a lessermagnitude can be assigned to the fourth bit in y1 than if only one readoperation was performed. Since more information is obtained from theadditional read operations, the decoding process can be improved, e.g.,so that it converges more quickly or converges in cases in which itwould otherwise not converge if only one read operation was made. Inanother approach, the second read operation, or other additional readoperations, are not performed unless the decoding process does notsuccessfully converge, e.g., within a given amount of time or number ofiterations.

The decoder 408 receives the code word y1 and the initial LLRs. Asexplained additionally further below (see also FIGS. 12 and 13), thedecoder 408 iterates in successive iterations in which it determines ifparity checks of the error encoding process have been satisfied. If allparity checks are satisfied initially, the decoding process hasconverged and the code word is not errored. If one or more parity checkshave not been satisfied, the decoder will perform error correction byadjusting the LLRs of one or more of the bits which are inconsistentwith a parity check and then reapply the parity check to determine if ithas been satisfied. For example, the magnitude and/or polarity of theLLRs can be adjusted. If the parity check in question is still notsatisfied, the LLR can be adjusted again in another iteration. Adjustingthe LLRs can result in flipping a bit (e.g., from 0 to 1 or from 1 to 0)in some, but not all, cases. Once the parity check in question has beensatisfied, the next parity check, if applicable, is applied to the codeword. The process continues in an attempt to satisfy all parity checks.Thus, the decoding process of y1 is completed to obtain the decodedinformation and parity bits v and the decoded information bits i.

Note that, in the example discussed, the code word y1 from the firstread operation is decoded with assistance from one or more subsequentread operations. However, other approaches are possible. For example,the code word from a given read operation can be decoded with assistancefrom one or more prior read operations. Or, three read operations can betaken which determine two consistent results (e.g., both read state Y1)and one inconsistent result (e.g., state Y2), and the code word for theconsistent result can be decoded with assistance from the inconsistentresult. The LLR tables can be set accordingly.

In another option, the code word for the consistent result is decodeddirectly without assistance from the inconsistent result.

The examples discussed therefore include performing two or more reads inorder to mitigate the effect of noise, and combining the results ofthese reads to modify the LLR numbers pertaining to the state of eachstorage element. When the iteration process of the ECC decoder takes toolong to reach convergence, another read operation can be performed. Thedecoding can continue or can be paused while the additional read isperformed. Or, the additional read can be performed automaticallyregardless of the progress of the decoding. After the second readoperation is complete, the LLR signed magnitudes from the first readoperation can be updated based on the second read operation. In oneapproach, LLR values be added or otherwise combined from different readoperations. For instance, consider a noisy storage element whose LLRvalues are 10 and −10 for a given bit for the first and second readoperations. Such a table receives the read state as an input and outputsan LLR value for each bit in the code word which represents the readstate. After these two contradictory results are obtained, an LLR of 0,for instance, may be used for the decoding. The decoding engine will setthis bit high or low in order to attain convergence.

Performing more than two read operations is also possible. One approachis to add or to take the average or mean of LLR results from all reads.For example, LLR values may be 20, 10 and −10 for a given bit for thefirst, second and third read operations, respectively, in which case theaverage is 6.6. This is a simplified approach which is expedient. Otherapproaches can take into account the fact that an LLR of 20 indicatesmore that twice the probability of a given result than an LLR of 10 sothat, e.g., LLRs of 10 and 20 are combined to an LLR closer to 20 than10.

Also, by the time the results of the second read operation have becomeavailable, the iteration process may have advanced, and many LLR valuesmay have been updated in the attempt to converge. The results of thesecond read can therefore be combined with the real time iteratedresults of the first read that are currently available or with originalresults of the first read, or a hybrid approach can be used.

Further, if every one of the read operations yields the same result, fora given bit, we can assign a higher magnitude LLR for the bit. This peakLLR (LLRpeak) could be slightly higher than the LLR that would be gainedfrom only a single read (LLRsingle). If the series of reads yieldsdiffering results, then a lower final LLR could result. One method ofaggregating a series of reads is to take the average of the single-readLLRs and multiplying it by a normalization factor such asLLRpeak/LLRsingle, so that Final LLR=Averaged LLR of multiplereads×|LLRpeak/LLRsingle|

In an actual implementation, iteration for convergence based on thefirst read result can begin before the second read results are obtained.Once the second read result and, in general, all subsequent read resultsare obtained, one strategy for incorporating the subsequent read resultsinto the iteration process is to presume that all the raw results asread from the storage elements are still available. In this case, we maysimply interrupt the current iteration and begin anew with Final LLR ascalculated above. Another strategy for aggregating LLR values is tocombine newly acquired read results with the current iterated LLRvalues. This could be done, e.g., by weighting the nth read result by1/(n−1) relative to the previous results, so that all read results areweighted equally.

Which strategy is more appropriate could be determined by how well thecurrent iteration is proceeding. The “goodness” of the iteration can bemeasured by a quality metric, such as whether or not the iterativedecoding process approaches convergence within a given time periodand/or a given number of iterations, and/or a specific number of paritychecks have been satisfied. Further, the metric can be used in areal-time adaptive decision process during the decoding, e.g., bychoosing to continue the current decoding process, without performing anadditional read, continue the decoding while adjusting the most recentlyused LLRs based on an additional read, or restarting the decoding withnew initial LLRs based on an additional read.

LLRs or other probabilistic metrics can be developed, e.g., by obtainingprobability density functions of the programmed states of a set ofnon-volatile storage elements, as follows.

FIG. 5 a is a flowchart of a process for obtaining a first probabilitydensity function f1 for a set of non-volatile storage elements. Step 500includes programming random data to all storage elements in a set of Mstorage elements. That is, assuming the storage elements are multi-levelstorage elements with n levels or programming states, after theprogramming, about M/n of the storage elements are programmed to a firststate, M/n are programmed to a second state, and so forth. For instance,n=16 states in one possible approach, where each state is represented bya four-bit code word. At step 502, index k, which represents a kthprogramming state, is initialized to zero. Step 506 includes making Nrepeated measurements of the V_(TH) for each storage element written tostate k, where N is a large number such as one hundred. These aremeasurements which are made in a test environment to a higher degree ofaccuracy than is used during production use of the storage device.

Step 508 includes obtaining a noise-free V_(TH) for each storageelements by averaging the N measurements for each storage element. Thisresults in about M/n noise-free V_(TH) values over the set of storageelements. For example, FIG. 6 a depicts a distribution of voltagethreshold readings. For state 0, for instance, the histogram 600 showsthe number of readings which fell into different ranges or buckets ofV_(TH) for an example first storage element Histogram 602 is for state 1for an example second storage element and histogram 604 is for state 15for an example third storage element. Histograms for the intermediatestates are not depicted. FIG. 6 b depicts noise-free voltage thresholdreadings. For example, V_(TH-NF0), V_(TH-NF1), . . . , V_(TH-NF15)represent the noise-free voltage threshold readings for states 0, 1, . .. , 15, respectively, for the example first, second and third storageelement. Note that the raw V_(TH) readings can be averaged as wellinstead of using the histograms of FIG. 6 a. Further, statisticaltechniques other than averaging can be used.

Step 510 (FIG. 5 a) includes constructing a histogram of thedistribution of the noise-free V_(TH) across all storage elements in theset which were programmed to state k, using the noise-free V_(TH) fromeach storage element. For example, FIG. 6 c depicts a distribution ofnoise-free voltage threshold readings, e.g., readings which hare freefrom read noise. Other noise may still be present. For state 0 for theexample first storage element, for instance, the histogram 650 shows thenumber of readings which fell into different ranges or buckets ofV_(TH). Histogram 652 is for state 1 for the example second storageelement and histogram 654 is for state 15 for the example third storageelement. Histograms for the intermediate states are not depicted. Thesehistograms are bar charts indicating how many of the M storage elementcells have a V_(TH) in the designated ranges. Step 512 includesnormalizing and curve fitting the histogram to obtain f1(u|X=state k), aprobability density function (pdf) indicating the probability that astorage element will have a V_(TH)=u, when the storage element wasprogrammed to state k. For example, with k=0, the function f1(u|X=0) isthe probability that if we randomly selected a storage element from thememory chip, where the storage element had been programmed (written) tostate 0, its noise-free V_(TH) will be u. Normalizing the histogram(e.g., one of the histograms 650, 650, . . . , 654) can includingdividing the height of the bar chart by M so that the sum of the heightsof every bar for a given state is one.

At step 514, the results are stored, e.g., including data defining thefunction f1(u|X=state k). If a next state is to be analyzed at decisionstep 516, the state index is incremented at step 504 and processingproceeds again at steps 506-514. This processing is repeated for allmemory states. The process ends at step 518 at which time the pdfsf1(u|X=0), f1(u|X=1), f1(u|X=2), f1(u|X=3), . . . f1(u|X=n) have beenobtained, where n+1 is the number of states. FIG. 6 d depictsprobability distributions of voltage threshold for different states of aset of non-volatile storage elements. Here, example distributions 660for f1(u|X=0), 662 for f1(u|X=0), . . . and 664 for f1(u|X=15) aredepicted, where there are sixteen states. Distributions for theintermediate states are not depicted.

FIG. 5 b is a flowchart of a process for obtaining a second probabilitydensity function f2 for a set of non-volatile storage elements. Steps520, 522 and 526 correspond to steps 500, 502 and 506 of FIG. 5 a.Further, these steps can be repeated relative to the corresponding stepsof FIG. 5 a, or the results from steps 500, 502 and 506 of FIG. 5 a canbe used starting at step 528 of FIG. 5 b. Step 528 includes subtractingthe noise-free V_(TH) from each measurement to obtain shiftedmeasurements. For example, FIG. 7 a depicts a histogram of such shiftedmeasurements or deviations from the noise-free V_(TH). Step 530 includesconstructing a histogram of the distribution of the shifted measurementsacross all storage elements which were programmed to state k. FIG. 7 bdepicts an example of such a histogram. Step 532 includes normalizingand curve fitting the histogram to obtain a function f2(v), which is thepdf indicating the probability that a storage element will have a readV_(TH) which deviates from the noise-free V_(TH) by v when the storageelement was programmed to state k. The function f2(v) is the probabilitythat if we randomly selected a storage element from the memory chip andmade a single measurement, the resultant V_(TH) is a distance “v” awayfrom the “true” or “noise-less” V_(TH). FIGS. 7 c-7 e depicts examplesof f2(v) for different states. Specifically, FIG. 7 c depicts aprobability distribution of voltage threshold deviations for State 0,FIG. 7 d depicts a probability distribution of voltage thresholddeviations for State 1, and FIG. 7 e depicts a probability distributionof voltage threshold deviations for State 15. Note that the figuresindicate that it is possible for the distributions to differ somewhat.The distributions of f1 can similarly differ.

At step 534, the results are stored, e.g., including data defining thefunction f2(v). If a next state is to be analyzed at decision step 536,the state index is incremented at step 524 and processing proceeds againat steps 526-534. This processing is repeated for all memory states. Theprocess ends at step 538, at which time the pdf f2(v) has been obtainedfor all states.

Note that the pdfs can vary with the age/cycling of the memory device asthe noise free V_(TH) distribution can change. The measurements ofV_(TH) which are used to obtain the pdfs can be performed at differentdevice ages and averaged, for instance, to obtain pdfs, and resultingLLRs, which are representative of an average device age.

FIG. 8 is a flowchart of a process for obtaining logarithmic likelihoodratios (LLRs) for use in decoding read data from a non-volatile storageelement. At step 800, based on f1(u) and f2(v), we calculate theconditional probability P(Y₁, Y₂|X). This is the probability that, giventhat state X as written to a storage element, the first read yields Y₁,the second read yields Y₂, and so forth. The probability can beexpressed in the form P(Y₁,Y₂|X) when there are two read operations, orgenerally P(Y₁,Y₂, . . . , Y_(N)|X) when there are N read operations.The probability can be determined for measurements in a testenvironment. At step 810, based on these probabilities and Bayes rule,we calculate P(X|Y1,Y2) when there are two read operations, or generallyP(X|Y1,Y2,Y3,Y4 . . . Y_(N)) when there are N read operations.P(X|Y1,Y2) is the probability that, give a first read result Y1 and asecond read result Y2, the programmed state is X. Step 820 includescalculating an LLR for each bit given all possible y values for N reads,and step 830 includes storing the results, e.g., in one or more tables.That is, an LLR is assigned to each bit in each code word used torepresent a programmed state. During operation of the memory device, thetables that are provided can be used to find initial LLR numbers fordecoding, given the results of the multiple read operations, in onepossible approach.

Note that the technique outlined here assumes independence betweenread-noise and the “noise-less” V_(TH). The technique can be extended toencompass cases where the read-noise values and the “noise-less” V_(TH)are not independent. This essentially involves constructing a joint pdff(u, v₁, v₂, . . . v_(n)) from experimental data.

FIG. 9 a depicts a table which provides multi-bit code words fordifferent programmed states of a non-volatile storage element. Asmentioned previously, each programming state of a storage element can berepresented by a code word. For example, with sixteen states, a four bitcode word can be used. Further, an LLR or other reliability metric isassociated with each bit indicating the probability that the bit is noterrored (a higher magnitude LLR indicates a higher probability that thebit is not errored). FIG. 9 a depicts bit values or code words incolumns beneath the programmed states 0 through 15. The bits positionsare depicted as top, higher, upper and lower. The lower bit is the mostsignificant bit and the top bit is the least significant bit. Thus, thecodeword for state 0 is 1111, the code word for state 1 is 1110 and soforth. An LLR is associated with each bit as indicated in FIG. 9 b.

FIG. 9 b depicts a table which provides initial values of LLRs for eachbit of a code word based on a first read result. The LLRs are denoted byvalues M1, M2 and M3, where M1<M2<M3. As mentioned previously, apositive LLR indicates a 0 bit, a negative LLR indicates a 1 bit, and agreater magnitude indicates a greater reliability or probability ofcorrectness. For example, for the lower bits in states 0 through 5, theLLR=−M3, indicating these bits have a high probability of being a 1.This can be seen intuitively, since the probability that the read stateY1 is far away from the programmed state, e.g., several states away, issmall. Thus, the LLR for the lower bit for state M1 is −M3 (higherprobability of correctness) since the read state would have to be off bythree states from the programmed state, e.g., state 8 (where the lowerbit is 0, not 1). However, the LLR for the lower bit for state 6 is −M2(intermediate probability of correctness) since the read state wouldhave to be off by two states for the bit to be errored. Similarly, theLLR for the lower bit for state 7 is −M1 (lower probability ofcorrectness) since the read state would have to be off by only one statefor the bit to be errored. Similar reasoning applies to the other bitpositions. For example, the LLRs for the top bits indicate a relativelylow probability of correctness since an error of only one state wouldresult in the bit being incorrect.

FIG. 9 c depicts a table which provides adjustments to current values ofLLRs used by a decoder for each bit of a code word based on a secondread result. When a second or other additional read operation isperformed, the LLR values can be adjusted. In one possible approach,adjustments are made to the LLR values which are currently used by thedecoder after having started decoding a first read result. As explainedfurther in connection, e.g., with FIGS. 12 and 13, iterativeprobabilistic decoding involves applying parity checks of an errorcorrection code to the read codeword. If a parity check fails, thedecoder adjusts the LLR values in a direction toward satisfying theparity check. This process can be repeated in successive iterations.Sometimes the adjustments end up in an incorrect bit being flipped andthe parity check being satisfied. In this case, the next parity check isperformed, if applicable. An adjustment can thus be made to the current(most recently used) values of the LLRs while decoding is taking place.

Generally, if the second read value is consistent with the first readvalue, on a per bit basis, the current LLR can be increased in magnitudeto indicate that the bit has a greater reliability. For example, if thefirst read is Y1=state 7 (code word 1011) and the second read isY2=state 6 (code word 1010) the LLR can be adjusted for the top bit toindicate a greater probability that the bit is 1. Note that the initialLLR for the bit was −M1 based on the first read (FIG. 9 b), but thedecoding process may have changed this value to another negative or evenpositive value. The adjustment is applied to the current value, in thisimplementation. For example, the LLR may currently be −M2 in which iscase it might be adjusted to a negative value greater in magnitude thanM2. Or, the LLR may currently be +4 in which is case it might beadjusted to +1.

Note that the adjustment can be expressed in different ways, e.g., by aconstant added or subtracted, or by a function. A table need not beused. For example, the adjustment can be made based on the magnitude ofthe LLR. It may not be necessary to adjust an LLR with a highermagnitude, or a relatively smaller adjustment may be made in such ascase. Conversely, a relatively larger adjustment may be made when theLLR has a smaller magnitude. Generally, the adjustment can be based onfactors such as how close the decoding process is to converging (e.g.,based on the number of iterations and/or number of parity checkssatisfied), the present values of the LLRs and/or the second or otheradditional read state. Testing of different adjustments can also beperformed to determine satisfactory adjustments. The specificadjustments used can be tailored to the specific memory deviceimplementation.

FIG. 10 a-d depict tables which provide initial values of LLRs for eachbit of a code word based on first and second read results. In oneapproach, the initial values of the LLRs which are used in the decodingprocess can be set based on the results of multiple read operations. Aseparate table can be provided for each bit position of the code words.For example, FIGS. 10 a, 10 b, 10 and 10 d provide LLR values for a topbit, higher bit, upper bit and lower bit, respectively. Each table canbe read based on two read results Y1 and Y2, for instance. After readingeach table, an initial LLR for each bit is provided to the decodingprocess. Note that the tables can have three or more dimensions if threeor more read operations are used.

FIG. 11 a is a flowchart of a process for decoding a code word whichrepresents a state of a non-volatile storage element, where initialprobability metrics are obtained based on first and second readoperations. Step 1100 includes beginning a first read operation. A readoperation can include sensing whether the V_(TH) of a storage element isabove or below a number of compare points (step 1102). Some of thecomparison points can result in hard bits, e.g., for comparison pointsthat separate V_(TH) ranges of programming states, and some of thecomparison points can result in soft bits, e.g., for comparison pointsthat bisect a V_(TH) range of a programming state. In one approach, theread operation can use a first set of compare points followed by asecond set of compare points which bisect the first set.

Each compare point determination can be considered to be a senseoperation as can the read operation as a whole. In practice, a number ofstorage elements may be read during the read operation. For example, theerror correction coding may be applied over a number of storageelements, in which case read results are obtained from those storageelements for use in the decoding. Based on the sensing, the programmingstates of the storage elements are determined (step 1104) and code wordsare assigned based on the programming states (step 1106). For example,the code words or bit assignments of FIG. 9 a may be used when there aresixteen states. A second read operation begins at step 1108 such as byagain sensing whether the V_(TH) of a storage element is above or belowthe compare points (step 1110). Based on the sensing, the programmingstates of the storage elements are again determined (step 1112).

Step 1114 includes assigning initial probability metrics to each bit inthe code words, where the metrics indicate a reliability of the bitbased on the first and second read results. For example, this step caninvolve reading the tables of FIGS. 10 a-10 d to obtain LLRs, althoughother probabilistic metrics can be used as well. Step 1116 includesperforming iterative decoding using the initial probability metrics, andadjusting the probability metrics in successive iterations. After theiterations of step 1116, if the decoding converges, e.g., all paritychecks of the error correction code are satisfied, at decision step1118, the decoded code words are stored as the final read result (step1120). Note that the code words which are associated with the errorcorrection process can be decoded at the same time when the paritychecks extend over the code words. Alternatively, it is possible for asingle code word to be decoded by itself when one or more parity checksinvolve only that code word. If the decoding does not converge, an erroris declared or an additional read operation can be performed, forinstance, at step 1122.

FIG. 11 b is a flowchart of a process for decoding a code word whichrepresents a state of a non-volatile storage element, where initialprobability metrics are obtained based on a first read operation, thenadjusted probability metrics are adjusted further based on a second readoperation. As discussed, in one implementation, the decoding process canbe temporarily paused so that the current LLR values, which are adjustedrelative to the initial LLR values, are adjusted further based on one ormore additional read results. Steps 1130, 1132, 1134 and 1136 correspondto steps 1100, 1102, 1104 and 1106, respectively, of FIG. 11 a. Step1138 includes assigning initial probability metrics to each bit in thecode words, where the metrics indicate a reliability of the bit based onthe first read results. For example, this step can involve reading thetable of FIG. 9 b to obtain LLRs, although other probabilistic metricscan be used as well. Step 1140 includes performing iterative decodingusing the initial probability metrics, and adjusting the probabilitymetrics in successive iterations. If the decoding converges within agiven time period, e.g., elapsed time, and/or a given number ofiterations, at decision step 1142, the decoded code words are stored asthe final read result (step 1144). If the decoding progresses towardconverging, such as by satisfying a specified number of parity checks,the decoding continues. Appropriate software, hardware and/or firmwarecan be provided in the decoder to enforce this provision.

If the decoding does not converge or progress toward convergence, thedecoding is adjusted. The current values of the probability metrics arestored at step 1146, and a second read operation is begun at step 1148.Steps 1150 and 1152 correspond to steps 1132 and 1134, respectively.Step 1154 includes adjusting the current values of the probabilitymetrics based on the second read. For example, this can include applyingthe LLR adjustments depicted by the table of FIG. 9 c. At step 1156, theiterative decoding continues using the adjusted values of theprobability metrics, which can be adjusted further in subsequentiterations. The decoding process is improved due to the informationprovided by the second read. For example, the decoding process mayconverge sooner than if only results from a single read were used, orthe decoding process may converge where the singe read case would notconverge. At decision step 1158, the decoding status is again checked,similar to the check of decision step 1142. If the decoding convergeswithin a given time period, e.g., elapsed time, and/or a given number ofiterations, at decision step 1158, the decoded code words are stored asthe final read result (step 1160). Note that the metric for progressiontoward converging in step 1158 can be more lax than at step 1142. If thedecoding progresses toward converging, such as by satisfying a specifiednumber of parity checks, the decoding continues. If the decoding doesnot meet the second condition, an error can be declared or an additionalread operation can be performed (step 1162), and the results of thatread operation used to adjust the decoding process again.

Note that, instead of beginning an additional read operation in responseto the decoding not meeting a certain condition, it is possible toperform the additional read operation automatically after the decodingprocess has started based on the first read operation. In this case, thedecoding process can be paused to update the LLRs when the second readoperation has been completed, regardless of whether the decoding meets acertain condition, or the results of the second operation can be storedfor subsequent use in the decoding process, if necessary.

FIG. 11 c is a flowchart of a process for decoding a code word whichrepresents a state of a non-volatile storage element, where initialprobability metrics are obtained based on a first read operation, thennew initial probability metrics are obtained based on the first readoperation and a second read operation. In this approach, the decoding isessentially restarted from the beginning when it does not meet a certaincondition based on initial LLRs obtained from a first read. However, thenew initial LLRs are based on both the first and second read operations,or other additional read operations. Steps 1170, 1172, 1174, 1176, 1178,1180, 1182 and 1184 correspond to steps 1130, 1132, 1134, 1136, 1138,1140, 1142 and 1144, respectively, of FIG. 11 b. At step 1186, when thedecoding does not meet the first condition at decision step 1182, anerror can be declared or an additional read can be performed. If asecond read operation is to be used (step 1188), the current values ofthe probability metrics, e.g., LLRs, which are used by the decoder arediscarded at step 1190. Steps 1192 and 1194 correspond to steps 1150 and1152, respectively, of FIG. 11 b. At step 1196, new initial probabilitymetrics are assigned to each bit in the code words, e.g., the code wordsassigned in step 1176. These new probability metrics indicate thereliability of the bits based on the first and second read operations.

FIG. 12 depicts a sparse parity check matrix. As mentioned previously,the storage elements store data which represents information bits andparity bits, where the parity bits are provided according to an errorcorrection coding process. Such a process involves adding parity bits toinformation bits. In one possible approach, a low density parity check(LDPC) code is used. In practice, such codes are typically applied tomultiple code words which are encoded across a number of storageelements. LDPC codes are desirable because they incur a relatively lowoverhead cost. Moreover, LDPC codes exhibit a performance near theShannon limit under iterative message-passing decoding algorithms.However, this is an example implementation only, as any type of errorcorrection code can be used. For example, other linear block codes maybe used.

An LDPC code is a linear block code which is characterized by a sparseparity check matrix, e.g., as depicted by the matrix H 1200. The matrixincludes K information bits and M parity bits, and the code length isN=K+M. Further, the parity bits are defined such that M parity checkequations are satisfied, where each row of the matrix represents aparity check equation. In particular, the rows of the matrix areidentified by check nodes cn1 through cn10 and the columns areidentified by variables v1 through v13, which indicate the data that isstored in the storage elements, e.g., the code word bits. This dataincludes information bits i and parity bits p, based on the equation:

${{H \cdot \overset{\_}{v}} = {{H \cdot \left\lbrack \frac{\overset{\_}{i}}{\overset{\_}{p}} \right\rbrack} = 0}},$where H is the sparse parity check matrix, v is the data matrix, ī isthe information bit matrix and p is the parity bit matrix. Theinformation bits can be taken from different bit positions of differentcode words, in one approach. The data matrix v can be determined bysolving the above equation. Further, this can be done efficiently usinga Gaussian elimination procedure if the matrix H is lower triangular.

FIG. 13 depicts a sparse bipartite graph which corresponds to the sparseparity check matrix of FIG. 12. The graph 1300 indicates in furtherdetail how the LDPC code works. The variable nodes v1 through v13represent the code word bits and the check nodes cn1 through cn10represent the parity check constraints on the bits.

During decoding, the decoder attempts to satisfy the parity checks. Inthis example, there are ten parity checks as indicated by the checknodes cn1 through cn10. The first parity check at cn1 determines ifv2{circle around (×)}v4{circle around (×)}v11{circle around (×)}v13=0,where {circle around (×)} denotes the exclusive-or (XOR) logicaloperation. This check is satisfied if there is an even number of “1”bits in v2, v4, v11 and v13. This check is denoted by the fact thatarrows from nodes v2, v4, v11 and v13 point to node cn1 in the graph1300. The second parity check at cn2 determines if v1{circle around(×)}v7{circle around (×)}v12=0, which is satisfied if there is an oddnumber of “1” bits. The third parity check at cn3 determines ifv3{circle around (×)}v5{circle around (×)}v6{circle around (×)}v9®v10=0,which is satisfied if there is an odd number of “1” bits. Similarly, thefourth parity check at cn4 determines if v2{circle around (×)}v8{circlearound (×)}v11=0, the fifth parity check at cn5 determines if v4{circlearound (×)}v7{circle around (×)}v12=0, the sixth parity check at cn6determines if v1{circle around (×)}v5{circle around (×)}v6{circle around(×)}v9=0, the seventh parity check at cn7 determines if v2{circle around(×)}v8{circle around (×)}v10{circle around (×)}v13=0, the eighth paritycheck at cn8 determines if v4{circle around (×)}v7{circle around(×)}v11{circle around (×)}v12=0, the ninth parity check at cn9determines if v1{circle around (×)}v3{circle around (×)}v5{circle around(×)}v13{circle around (×)}=0 and the tenth parity check at cn01determines if v7{circle around (×)}v8{circle around (×)}v9{circle around(×)}v10=0.

The decoding process for LDPC is an iterative probabilistic decodingprocess known as iterative message passing decoding. The iteratinginvolves serially traversing the check nodes and updating the LLR valuesof the bits involved based on each parity check. In one approach, anattempt is made to satisfy the first parity check of cn1. Once thatparity check is satisfied, an attempt is made to satisfy the firstparity check of cn2 and so forth. The LLR values are adjusted, ifnecessary, for each iteration in a manner known to those skilled in theart. This iterative algorithm is a form of belief propagation.

Use of Pre-Conditioning Waveforms

The sensed programming state of a storage element can vary overdifferent read operations based on the history of the storage element.For example, the state of a storage element will sometimes changebetween two read operations, and the number of storage elements changingtheir states can depend on the history of the control gate voltages. Thetendency for a read state to change is based on a variety of factorsincluding the V_(TH) width for each state, the spacing between states,trap site noise and other factors.

It can be useful, therefore, to intentionally create different shortterms histories when two or more sensing operations are performed. Inone approach, the different short terms histories are used at the samevoltage level, or at neighboring voltage levels. For example, just priorto a first read operation, the selected word line can be grounded. Tomake a second read operation's short term history different, we canapply a pre-conditioning waveform in the form of a read pass voltage,e.g., V_(READ)=5.5 V, for instance, to the selected word line justbefore the onset of the second read operation. V_(READ) is the voltagewhich is typically applied to unselected word lines when storageelements on a selected word line are being read. However, this is oneexample among many possible implementations. For example, the amplitude,duration and shape, in addition to the time interval between the end ofthe pre-conditioning waveform and the onset of the integration time ofsensing, are all parameters that can be optimized, e.g., with the RCtime constants of the word lines and other lines in mind.

To the extent that the V_(TH) of some noisy storage elements depends onthe short term history of the biases applied to their various terminals,such as the control gate and the C-P-well, the differing short termhistories should increase the noisy behavior, and this increase willhelp us identify more suspect bits. This additional information shouldimprove the iterative decoding process by identifying noisy storageelements and acknowledging the uncertainty about the value of certainbits. This acknowledgment of ignorance can be used to help focus theattention of the decoding process on the more troublesome bits wheremore attention is needed. In short, it is better to know that we do notknow the value of some bits than to pretend that we do.

The use of a pre-conditioning waveform prior to the sense operationallows the history of the read operation to more closely resemble thehistory of the verify operation performed during programming, as theshort term history of the verify operation included a program (V_(PGM))pulse. Example V_(PGM) pulses, whose amplitude varies between, e.g.,13-20 V, are depicted in FIG. 24. The amplitude of the pre-conditioningwaveform need not be as high as the amplitude of the program pulse, butit can still duplicate the short term history that a storage elementundergoes when a verify operating is performed as part of theprogramming operation. Thus, the effect that a program pulse has on astorage element prior to a verify operation is replicated in part by theeffect that a pre-conditioning pulse has on a storage element prior to aread operation. For implementations where additional read operations areperformed only when necessary, e.g., when the decoding process based onone read operation is not converging, the first read can use thepre-conditioning waveform prior to the sense operation, in one possibleapproach.

Further, the probability metrics, such as LLRs, which are used in thedecoding process can account for the effects of pre-conditioningwaveforms. Thus, the initial LLR assigned to a bit can vary based on thehistory of the associated storage element. For example, if the same bitvalue in a codeword is obtained from read operations with and withoutpre-conditioning waveforms, the magnitude of the LLR for the bit shouldbe higher to indicate a more confident measure of the bit's value thanif the bit value was obtained from read operations both withoutpre-conditioning waveforms, or even both with pre-conditioningwaveforms.

In particular, the pre-conditioning waveforms can affect the probabilitydistribution functions (pdfs) of the various programming states. Thiseffect is measurable by comparing the pdf of the V_(TH) distribution ofa set of storage elements with and without pre-conditioning waveforms.Generally, with the use of pre-conditioning waveforms, a mathematicalmethod for aggregating the LLR results from multiple reads can beprovided based on the behavior of the storage elements for a particulartechnology. For example, if every one of the reads yields the sameresult, then we can assign a high-magnitude LLR for the bits of the codeword which represent the state. This peak LLR (LLRpeak) would be higherthan the LLR that is used for only a single read (LLRsingle). On theother hand, if the series of reads yields differing results, then alower final LLR would result. One method of aggregating a series ofreads is to take the average of the single-read LLRs, and thenmultiplying the average by a normalization factor such asLLRpeak/LLRsingle.

Examples below depict one or more pre-conditioning pulses occurringbefore the sensing operations of a read operation, but, in general, anypre-conditioning waveform can be used. Further, a pre-conditioningwaveform can be applied to a terminal of a storage element, e.g.,control gate, source and/or drain, and/or to a substrate on which thestorage element is formed. For instance, a first pre-conditioningwaveform can be applied to a non-volatile storage element via a body ofa substrate on which the non-volatile storage element is formed, and asecond pre-conditioning waveform can be applied to a control gate, asource and a drain of the storage element. Or, the secondpre-conditioning waveform can be applied to a non-volatile storageelement via the body, and the first pre-conditioning waveform applied tothe control gate, source and/or the drain of the storage element.Further, pre-conditioning waveforms with different characteristics canbe applied during one read operation or during different readoperations.

Moreover, the pre-conditioning waveform can be used in a single readapproach as well as a multiple read approach. Further, thepre-conditioning waveform can be used with any type of error correctiondecoding, or without error correction decoding.

FIG. 14 a is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a selected word line before an associated readpulse. The waveforms in this and the other timing diagrams are notnecessarily to scale.

As mentioned, a pre-conditioning waveform can be applied to a storageelement as part of the read operation. This can involve applying apre-conditioning waveform such as a pulse to the control gates of thestorage elements being read via the associated selected word line, forinstance, prior to applying a sensing voltage to the word line forcomparing the V_(TH) of the storage elements to a compare point.Moreover, a preconditioning waveform can be applied just prior to anyread or just prior to some of the read operations. It can be combinedwith various levels of soft reads. It can also be combined with themultiple reads at the same voltage level.

In general, during read and verify operations, the selected word line orother control line is connected to a voltage, a level of which isspecified for each read and verify operation, in order to determinewhether a threshold voltage of the concerned storage element has reachedsuch level. After applying the word line voltage, the conduction currentof the storage element is measured to determine whether the storageelement turned on. If the conduction current is measured to be greaterthan a certain value, then it is assumed that the storage element turnedon and the voltage applied to the word line is greater than thethreshold voltage of the storage element. If the conduction current isnot measured to be greater than the certain value, then it is assumedthat the storage element did not turn on and the voltage applied to theword line is not greater than the threshold voltage of the storageelement.

There are many ways to measure the conduction current of a storageelement during a read or verify operation. In one example, theconduction current of a storage element is measured by the rate itallows (or fails to allow) the NAND string that included the storageelement to discharge the bit line. The charge on the bit line ismeasured after a period of time to see whether it has been discharged ornot. In another embodiment, the conduction of the selected storageelement allows current to flow or not flow on a bit line, which ismeasured by whether a capacitor in the sense amplifier is charged due tothe flow of current. Both examples are discussed.

In particular, waveform 1400 depicts a drain side select gate voltage(SGD), waveform 1402 depicts an unselected word line voltage, waveform1404 depicts a selected word line voltage (of the word line selected forreading/verification), waveform 1410 depicts a source side select gate(SGS) voltage (option 1), waveform 1412 depicts a SGS voltage (option2), waveform 1414 depicts a selected bit line (BL) voltage (option 1)(of the bit line selected for reading/verification), waveform 1418depicts a selected BL voltage (option 2) and waveform 1419 depicts asource voltage. Additionally, time points t0-t4 extend in the horizontaldirection.

Note that there are two versions of SGS and Selected BL depicted. Optiondepicts a read/verify operation for an array of storage elements thatmeasure the conduction current of a storage element by determiningwhether the bit line has discharged. Option 2 depicts a read/verifyoperation for an array of storage elements that measure the conductioncurrent of a storage element by the rate it discharges a dedicatedcapacitor in the sense amplifier.

First, the behavior of the sensing circuits and the array of storageelements that are involved in measuring the conduction current of astorage element by determining whether the bit line has discharged willbe discussed with respect to option 1.

Prior to t0, the voltages start at a steady state voltage, Vss, ofapproximately 0 V. Between t0 and t1, a pre-conditioning waveform 1406is applied to the selected word line prior to the read pulse 1408. Notethat, in another approach, the pre-conditioning waveform or waveformscan overlap with the sense operation (read pulse 1408). At t2, SGD andSGS (option 2) are raised to V_(SGD) and V_(SGS), respectively (e.g.,3.5 V). The unselected word lines are raised to V_(READ) (e.g., 6 V),which acts as an overdrive voltage because it causes the unselectedstorage elements to turn on and act as pass gates. The pre-conditioningwaveform 1406 can have an amplitude comparable to V_(READ), forinstance, which is greater than V_(CGR). V_(READ) is generally thehighest voltage which can be applied without causing disturbs. Theselected word line is raised to V_(CGR) (control gate read voltage) fora read operation or to a verify level for a verify operation. Thewaveform on the selected word line between t2 and t4 is considered to bea read pulse 1408 which is used during a sense operation. The selectedBL (option 1) is pre-charged to approximately 0.7 V, in one approach.

The process depicted in FIG. 14 a can then be repeated at the next reador verify level, in which a different V_(CGR) is applied to sensewhether the V_(TH) of the storage elements associated with the selectedword line is above or below a corresponding compare point. In oneapproach, the pre-condition pulse is provided before each senseoperation.

At t3, the NAND string can control the bit line. Also at t3, the sourceside select gate is turned on by raising SGS (option 1) to V_(SGS). Thisprovides a path to dissipate the charge on the bit line. If the V_(TH)of the storage element selected for reading is greater than V_(CGR) orthe verify level applied to the selected word line, then the selectedstorage element will not turn on and the bit line will not discharge, asdepicted by line 1415. If the threshold voltage in the storage elementselected for reading is below V_(CGR) or below the verify level appliedto the selected word line, then the storage element selected for readingwill turn on (conduct) and the bit line voltage will dissipate, asdepicted by curve 1416. At some point after time t3 and prior to time t4(as determined by the particular implementation), the sense amplifierwill determine whether the bit line has dissipated a sufficient amount.In between t3 and t4, the sense amplifier measures the evaluated BLvoltage. At time t4, the depicted waveforms will be lowered to Vss (oranother value for standby or recovery).

Discussed next, with respect to option 2, is the behavior of the sensingcircuits and the array of storage elements that measure the conductioncurrent of a storage element by the rate at which it charges a dedicatedcapacitor in the sense amplifier. The pre-conditioning waveform 1406 isapplied between t0 and t1 as before. At time t2, SGD is raised toV_(SGD), the unselected word lines are raised to V_(READ), and theselected word line is raised to V_(CGR) for a read operation or to averify level for a verify operation. In this case, the sense amplifierholds the bit line voltage constant regardless of what the NAND sting isdoing, so the sense amplifier measures the current flowing with the bitline “clamped” to that voltage. At some point after t2 and prior to t4(as determined by the particular implementation), the sense amplifierwill determine whether the capacitor in the sense amplifier hasdissipated a sufficient amount. At t4, the depicted waveforms will belowered to Vss (or another value for standby or recovery). Note that inother embodiments, the timing of some of the waveforms can be changed.

FIG. 14 b is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where one or morepre-conditioning waveforms are applied to a selected word line beforeassociated read pulses. Here, three sense operations which make up partof a read operation are depicted. For example, with sixteen states,there would be fifteen sense operations. Waveforms 1420, 1422, 1424,1435, 1436, 1437, 1438 and 1439 represent the waveforms 1400, 1402,1404, 1410, 1412, 1414, 1418 and 1419, respectively, of FIG. 14 a overthree sense operations. Waveform 1432 is analogous to waveform 1424 butincludes the pre-conditioning waveform 1426 only before the first readpulse 1427 in the series of read pulses which make up a read operation.In waveform 1424, pre-conditioning waveforms 1426, 1428 and 1430 areapplied to the selected word line prior to read pulses 1427, 1429 and1431, respectively, which are associated with different V_(TH) comparepoints. The read pulse amplitude increases in each sense operation,e.g., from V_(CGR-1) to V_(CGR-2) to V_(CGR-3) and so forth. Theamplitude of the pre-conditioning waveforms can be the same for eachsense operation or can vary, e.g., with the amplitude of the associatedread pulse. Regarding waveform 1432, note that, generally, thepre-conditioning waveform can be provided before one or more selectedread pulses in a read operation. Moreover, the read pulses which haveassociated pre-conditioning waveforms can be selected randomly.

FIG. 14 c is a timing diagram that depicts different pre-conditioningwaveforms. As mentioned, the pre-conditioning waveform can have avariety of characteristics. Waveform 1404 (discussed also in FIG. 14 a)includes a baseline pre-conditioning waveform 1406 as a pulse which isprovided before a read pulse 1408. Note that the waveform 1406 and thepulse 1408 show a rise time and a decay time which will vary fordifferent memory devices. In one option, waveform 1440 includes apre-conditioning waveform 1441 as a pulse with a longer durationcompared to pulse 1406. Waveform 1442 includes a pre-conditioningwaveform 1443 which is closer to the read pulse 1408 compared to pulse1406. The time period at issue extends from the end of thepre-conditioning waveform to the start of the read pulse. Waveform 1444includes a pre-conditioning waveform 1445 as a pulse with a loweramplitude, compared to pulse 1406. Waveform 1446 includes apre-conditioning waveform 1447 with a different shape, compared to pulse1406. Here, an oscillating waveform is used. While this waveform doesnot replicate a programming pulse, and even includes negative voltages,it can still provide useful, e.g., in causing trap site activity. Otherwaveforms including ramps, steps and so forth can also be used. It isalso possible to provide multiple pre-conditioning waveforms before aread pulse.

FIG. 14 d is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a drain of a selected storage element via aselected bit line before an associated read pulse. Generally, instead ofapplying a voltage waveform to the control gate of the storage elementsvia the selected word line, a voltage waveform can be applied to anyother terminal, such as a source or drain, or via a body bias. In thisapproach, the drain terminal is accessed via the bit line associatedwith a NAND chain. As depicted by waveform 1450, the drain side selectgate is opened by raising V_(SGD) between t0 and t1, at which time thepre-conditioning waveform 1456 or 1460 is applied (waveforms 1454 and1458, respectively) to the bit line. The drain side select gate can beopened just before t0 until just after t1 to envelope thepre-conditioning waveform 1460. The read pulse 1408 occurs between t2and t4, as discussed previously. The drain side select gate is alsoopened between t2 and t4 during the read pulse. Note that the drain sideselect gate can be kept open between t0 and t4 continuously as well.

FIG. 14 e is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a source of a selected storage element via asource line before an associated read pulse. In this approach, thesource terminal is accessed via the source line associated with a NANDchain. As depicted by waveforms 1462 or 1464, the source side selectgate is opened by raising V_(SGS) between t0 and t1, at which time thepre-conditioning waveform 1468 in waveform 1466 is applied on the sourceline. The source side select gate can be opened just before t0 untiljust after t1 to envelope the pre-conditioning waveform 1468. The readpulse 1408 occurs between t2 and t4, as discussed previously. The sourceside select gate is also opened between t2 and t4 during the read pulse.Note that the source side select gate can be kept open between t0 and t4continuously as well.

FIG. 14 f is a timing diagram that explains the behavior of certainwaveforms during read/verify operations, where a pre-conditioningwaveform is applied to a selected storage element via a body bias beforean associated read pulse. In this approach, the substrate which thestorage elements are formed on is biased with a body bias voltage V_(B).For example, a p-well of the substrate can be biased with V_(B)<0 toincrease the control gate to body voltage. As depicted by waveform 1470,the pre-conditioning waveform 1472 can be applied to the body between t0and t1. The read pulse 1408 occurs between t2 and t4, as discussedpreviously.

FIG. 14 g is a flowchart of a process for performing a read operation ona storage element, where pre-conditioning waveforms are applied to astorage element before associated read pulses. The read operation beginsat step 1474. An index k, representing a programmed state, isinitialized to zero at step 1475. A pre-conditioning waveform is appliedat step 1476 and a read pulse is applied at step 1477. Step 1478includes sensing whether the V_(TH) is above or below a read comparepoint. If there is a next compare point, at decision step 1479, theindex is incremented at step 1480, and steps 1476-1478 are repeated. Ifthere is no further compare point, the read operation has completed andthe results are stored at step 1481.

FIG. 14 h is a flowchart of a process for performing a read operation ona storage element, where a pre-conditioning waveform is applied to astorage element before a series of read pulses. The read operationbegins at step 1482. An index k, representing a programmed state, isinitialized to zero at step 1483. A pre-conditioning waveform is appliedat step 1484 and a read pulse is applied at step 1485. Step 1486includes sensing whether the V_(TH) is above or below a read comparepoint. If there is a next compare point, at decision step 1487, theindex is incremented at step 1488, and steps 1485 and 1486 are repeated.If there is no further compare point, the read operation has completedand the results are stored at step 1489. In this case, thepre-conditioning pulse is not used again for the read operation.

FIG. 14 i is a flowchart of a process for obtaining reliability metricsusing pre-conditioning waveforms for subsequent use in decoding. Seealso FIGS. 5 a, 5 b and 8. Step 1490 includes programming random data tostorage elements, and step 1491 initializes an index k to state 0. State1492 includes performing measurements of V_(TH) for each storage elementwritten to state k, sometimes with the pre-conditioning waveform andsometimes without, and/or with different waveform characteristics (e.g.,amplitude, duration and timing). In this manner, electrons in thestorage elements are influenced to enter and exit trap sites, forinstance, to provide a more accurate V_(TH) distribution. Step 1493includes obtaining reliability metrics (e.g., LLRs) for state k based onthe measurements and step 1494 includes storing the results. If there isa next state at decision step 1495, k is incremented at step 1496 andsteps 1492 and 1493 are repeated. If there is no next state, the processends at step 1497.

Thus, the pre-conditioning waveform can be used in a test environmentwhen reliability metrics are being developed for a non-volatile storageand/or in a production environment when the non-volatile storage isbeing read. In one approach, tables similar to those discussed inconnection with FIGS. 9 b-10 d can be developed with and without the useof pre-conditioning pulses, and/or with the use of differentpre-conditioning pulses. These tables are then accessed during theproduction environment when the non-volatile storage is being read, asdiscussed previously.

FIG. 15 illustrates an example of an array 1500 of NAND storageelements, such as those shown in FIGS. 1 and 2. Along each column, a bitline 1506 is coupled to the drain terminal 1526 of the drain select gatefor the NAND string 1550. Along each row of NAND strings, a source line1504 may connect all the source terminals 1528 of the source selectgates of the NAND strings. The NAND strings and storage elements areformed on a p-well 1505 which can receive a body bias V_(B) in someimplementations. An example of a NAND architecture array and itsoperation as part of a memory system is found in U.S. Pat. Nos.5,570,315; 5,774,397; and 6,046,935.

The array of storage elements is divided into a large number of blocksof storage elements. As is common for flash EEPROM systems, the block isthe unit of erase. That is, each block contains the minimum number ofstorage elements that are erased together. Each block is typicallydivided into a number of pages. A page is a unit of programming. In oneembodiment, the individual pages may be divided into segments and thesegments may contain the fewest number of storage elements that arewritten at one time as a basic programming operation. One or more pagesof data are typically stored in one row of storage elements. A page canstore one or more sectors. A sector includes user data and overheaddata. Overhead data typically includes an Error Correction Code (ECC)that has been calculated from the user data of the sector. A portion ofthe controller (described below) calculates the ECC when data is beingprogrammed into the array, and also checks it when data is being readfrom the array. Alternatively, the ECCs and/or other overhead data arestored in different pages, or even different blocks, than the user datato which they pertain.

A sector of user data is typically 512 bytes, corresponding to the sizeof a sector in magnetic disk drives. Overhead data is typically anadditional 16-20 bytes. A large number of pages form a block, anywherefrom 8 pages, for example, up to 32, 64, 128 or more pages. In someembodiments, a row of NAND strings comprises a block.

Memory storage elements are erased in one embodiment by raising thep-well 1505 on which the NAND strings and storage elements are formed toan erase voltage (e.g., 20 V) for a sufficient period of time andgrounding the word lines of a selected block while the source and bitlines are floating. Due to capacitive coupling, the unselected wordlines, bit lines, select lines, and c-source are also raised to asignificant fraction of the erase voltage. A strong electric field isthus applied to the tunnel oxide layers of selected storage elements andthe data of the selected storage elements are erased as electrons of thefloating gates are emitted to the substrate side, typically byFowler-Nordheim tunneling mechanism. As electrons are transferred fromthe floating gate to the p-well region, the threshold voltage of aselected storage element is lowered. Erasing can be performed on theentire memory array, separate blocks, or another unit of storageelements.

FIG. 16 is a block diagram of a non-volatile memory system using singlerow/column decoders and read/write circuits. The diagram illustrates amemory device 1696 having read/write circuits for reading andprogramming a page of storage elements in parallel, according to oneembodiment of the present invention. Memory device 1696 may include oneor more memory die 1698. Memory die 1698 includes a two-dimensionalarray of storage elements 1500, control circuitry 1610, and read/writecircuits 1665. In some embodiments, the array of storage elements can bethree dimensional. The memory array 1500 is addressable by word linesvia a row decoder 1630 and by bit lines via a column decoder 1660. Theread/write circuits 1665 include multiple sense blocks 1600 and allow apage of storage elements to be read or programmed in parallel. Typicallya controller 1650 is included in the same memory device 1696 (e.g., aremovable storage card) as the one or more memory die 1698. Thecontroller 1650 may include the ECC decoding capability discussedherein. Commands and Data are transferred between the host andcontroller 1650 via lines 1620 and between the controller and the one ormore memory die 1698 via lines 1618.

The control circuitry 1610 cooperates with the read/write circuits 1665to perform memory operations on the memory array 1500. The controlcircuitry 1610 includes a state machine 1612, an on-chip address decoder1614, a boost control 1615 and a power control module 1616. The statemachine 1612 provides chip-level control of memory operations. Theon-chip address decoder 1614 provides an address interface between thatused by the host or a memory controller to the hardware address used bythe decoders 1630 and 1660. The boost control 1615 can be used forsetting a boost mode, including determining a timing for initiatingsource side and drain side boosting, as discussed herein. The powercontrol module 1616 controls the power and voltages supplied to the wordlines and bit lines during memory operations.

In some implementations, some of the components of FIG. 16 can becombined. In various designs, one or more of the components (alone or incombination), other than storage element array 1500, can be thought ofas a managing circuit. For example, one or more managing circuits mayinclude any one of or a combination of control circuitry 1610, statemachine 1612, decoders 1614/1660, power control 1616, sense blocks 1600,read/write circuits 1665, controller 1650, etc.

FIG. 17 is a block diagram of a non-volatile memory system using dualrow/column decoders and read/write circuits. Here, another arrangementof the memory device 1696 shown in FIG. 16 is provided. Access to thememory array 1500 by the various peripheral circuits is implemented in asymmetric fashion, on opposite sides of the array, so that the densitiesof access lines and circuitry on each side are reduced by half. Thus,the row decoder is split into row decoders 1630A and 1630B and thecolumn decoder into column decoders 1660A and 1660B. Similarly, theread/write circuits are split into read/write circuits 1665A connectingto bit lines from the bottom and read/write circuits 1665B connecting tobit lines from the top of the array 1500. In this way, the density ofthe read/write modules is essentially reduced by one half. The device ofFIG. 17 can also include a controller, as described above for the deviceof FIG. 16.

FIG. 18 is a block diagram depicting one embodiment of a sense block. Anindividual sense block 1600 is partitioned into a core portion, referredto as a sense module 1680, and a common portion 1690. In one embodiment,there will be a separate sense module 1680 for each bit line and onecommon portion 1690 for a set of multiple sense modules 1680. In oneexample, a sense block will include one common portion 1690 and eightsense modules 1680. Each of the sense modules in a group willcommunicate with the associated common portion via a data bus 1672. Forfurther details refer to U.S. Patent Application Pub No. 2006/0140007,titled “Non-Volatile Memory and Method with Shared Processing for anAggregate of Sense Amplifiers” published Jun. 29, 2006, and incorporatedherein by reference in its entirety.

Sense module 1680 comprises sense circuitry 1670 that determines whethera conduction current in a connected bit line is above or below apredetermined threshold level. Sense module 1680 also includes a bitline latch 1682 that is used to set a voltage condition on the connectedbit line. For example, a predetermined state latched in bit line latch1682 will result in the connected bit line being pulled to a statedesignating program inhibit (e.g., V_(dd)).

Common portion 1690 comprises a processor 1692, a set of data latches1694 and an I/O Interface 1696 coupled between the set of data latches1694 and data bus 1620. Processor 1692 performs computations. Forexample, one of its functions is to determine the data stored in thesensed storage element and store the determined data in the set of datalatches. The set of data latches 1694 is used to store data bitsdetermined by processor 1692 during a read operation. It is also used tostore data bits imported from the data bus 1620 during a programoperation. The imported data bits represent write data meant to beprogrammed into the memory. I/O interface 1696 provides an interfacebetween data latches 1694 and the data bus 1620.

During read or sensing, the operation of the system is under the controlof state machine 1612 that controls the supply of different control gatevoltages to the addressed storage element. As it steps through thevarious predefined control gate voltages corresponding to the variousmemory states supported by the memory, the sense module 1680 may trip atone of these voltages and an output will be provided from sense module1680 to processor 1692 via bus 1672. At that point, processor 1692determines the resultant memory state by consideration of the trippingevent(s) of the sense module and the information about the appliedcontrol gate voltage from the state machine via input lines 1693. Itthen computes a binary encoding (code word) for the memory state andstores the resultant data bits into data latches 1694. In anotherembodiment of the core portion, bit line latch 1682 serves double duty,both as a latch for latching the output of the sense module 1680 andalso as a bit line latch as described above.

It is anticipated that some implementations will include multipleprocessors 1692. In one embodiment, each processor 1692 will include anoutput line (not depicted) such that each of the output lines iswired-OR'd together. In some embodiments, the output lines are invertedprior to being connected to the wired-OR line. This configurationenables a quick determination during the program verification process ofwhen the programming process has completed because the state machinereceiving the wired-OR can determine when all bits being programmed havereached the desired level. For example, when each bit has reached itsdesired level, a logic zero for that bit will be sent to the wired-ORline (or a data one is inverted). When all bits output a data 0 (or adata one inverted), then the state machine knows to terminate theprogramming process. Because each processor communicates with eightsense modules, the state machine needs to read the wired-OR line eighttimes, or logic is added to processor 1692 to accumulate the results ofthe associated bit lines such that the state machine need only read thewired-OR line one time. Similarly, by choosing the logic levelscorrectly, the global state machine can detect when the first bitchanges its state and change the algorithms accordingly.

During program or verify, the data to be programmed is stored in the setof data latches 1694 from the data bus 1620. The program operation,under the control of the state machine, comprises a series ofprogramming voltage pulses applied to the control gates of the addressedstorage elements. Each programming pulse is followed by a read back(verify) to determine if the storage element has been programmed to thedesired memory state. Processor 1692 monitors the read back memory staterelative to the desired memory state. When the two are in agreement, theprocessor 1692 sets the bit line latch 1682 so as to cause the bit lineto be pulled to a state designating program inhibit. This inhibits thestorage element coupled to the bit line from further programming even ifprogramming pulses appear on its control gate. In other embodiments theprocessor initially loads the bit line latch 1682 and the sensecircuitry sets it to an inhibit value during the verify process.

Data latch stack 1694 contains a stack of data latches corresponding tothe sense module. In one embodiment, there are three data latches persense module 1680. In some implementations (but not required), the datalatches are implemented as a shift register so that the parallel datastored therein is converted to serial data for data bus 1620, and viceversa. In the preferred embodiment, all the data latches correspondingto the read/write block of m storage elements can be linked together toform a block shift register so that a block of data can be input oroutput by serial transfer. In particular, the bank of r read/writemodules is adapted so that each of its set of data latches will shiftdata in to or out of the data bus in sequence as if they are part of ashift register for the entire read/write block.

Additional information about the structure and/or operations of variousembodiments of non-volatile storage devices can be found in (1) U.S.Patent Application Pub. No. 2004/0057287, “Non-Volatile Memory AndMethod With Reduced Source Line Bias Errors,” published on Mar. 25,2004; (2) U.S. Patent Application Pub No. 2004/0109357, “Non-VolatileMemory And Method with Improved Sensing,” published on Jun. 10, 2004;(3) U.S. patent application Ser. No. 11/015,199 titled “Improved MemorySensing Circuit And Method For Low Voltage Operation,” filed on Dec. 16,2004; (4) U.S. patent application Ser. No. 11/099,133, titled“Compensating for Coupling During Read Operations of Non-VolatileMemory,” filed on Apr. 5, 2005; and (5) U.S. patent application Ser. No.11/321,953, titled “Reference Sense Amplifier For Non-Volatile Memory,filed on Dec. 28, 2005. All five of the immediately above-listed patentdocuments are incorporated herein by reference in their entirety.

FIG. 19 illustrates an example of an organization of a memory array intoblocks for an all bit line memory architecture or for an odd-even memoryarchitecture. Exemplary structures of memory array 1500 are described.As one example, a NAND flash EEPROM is described that is partitionedinto 1,024 blocks. The data stored in each block can be simultaneouslyerased. In one embodiment, the block is the minimum unit of storageelements that are simultaneously erased. In each block, in this example,there are 8,512 columns corresponding to bit lines BL0, BL1, . . .BL8511. In one embodiment referred to as an all bit line (ABL)architecture (architecture 1910), all the bit lines of a block can besimultaneously selected during read and program operations. Storageelements along a common word line and connected to any bit line can beprogrammed at the same time.

In the example provided, 64 storage elements and two dummy storageelements are connected in series to form a NAND string. There are sixtyfour data word lines and two dummy word lines, WL-d0 and WL-d1, whereeach NAND string includes sixty four data storage elements and two dummystorage elements. In other embodiments, the NAND strings can have moreor less than 64 data storage elements and two dummy storage elements.Data memory cells can store user or system data. Dummy memory cells aretypically not used to store user or system data.

One terminal of the NAND string is connected to a corresponding bit linevia a drain select gate (connected to select gate drain lines SGD), andanother terminal is connected to c-source via a source select gate(connected to select gate source line SGS).

In one embodiment, referred to as an odd-even architecture (architecture1900), the bit lines are divided into even bit lines (BLe) and odd bitlines (BLo). In this case, storage elements along a common word line andconnected to the odd bit lines are programmed at one time, while storageelements along a common word line and connected to even bit lines areprogrammed at another time. Data can be programmed into different blocksand read from different blocks concurrently. In each block, in thisexample, there are 8,512 columns that are divided into even columns andodd columns.

During one configuration of read and programming operations, 4,256storage elements are simultaneously selected. The storage elementsselected have the same word line and the same kind of bit line (e.g.,even or odd). Therefore, 532 bytes of data, which form a logical page,can be read or programmed simultaneously, and one block of the memorycan store at least eight logical pages (four word lines, each with oddand even pages). For multi-state storage elements, when each storageelement stores two bits of data, where each of these two bits are storedin a different page, one block stores sixteen logical pages. Other sizedblocks and pages can also be used.

For either the ABL or the odd-even architecture, storage elements can beerased by raising the p-well to an erase voltage (e.g., 20 V) andgrounding the word lines of a selected block. The source and bit linesare floating. Erasing can be performed on the entire memory array,separate blocks, or another unit of the storage elements which is aportion of the memory device. Electrons are transferred from thefloating gates of the storage elements to the p-well region so that theV_(TH) of the storage elements becomes negative.

In the read and verify operations, the select gates (SGD and SGS) areconnected to a voltage in a range of 2.5 to 4.5 V and the unselectedword lines (e.g., WL0, WL1 and WL3, when WL2 is the selected word line)are raised to a read pass voltage, V_(READ), (typically a voltage in therange of 4.5 to 6 V) to make the transistors operate as pass gates. Theselected word line WL2 is connected to a voltage, a level of which isspecified for each read and verify operation in order to determinewhether a V_(TH) of the concerned storage element is above or below suchlevel. For example, in a read operation for a two-level storage element,the selected word line WL2 may be grounded, so that it is detectedwhether the V_(TH) is higher than 0 V. In a verify operation for a twolevel storage element, the selected word line WL2 is connected to 0.8 V,for example, so that it is verified whether or not the V_(TH) hasreached at least 0.8 V. The source and p-well are at 0 V. The selectedbit lines, assumed to be the even bit lines (BLe), are pre-charged to alevel of, for example, 0.7 V. If the V_(TH) is higher than the read orverify level on the word line, the potential level of the bit line (BLe)associated with the storage element of interest maintains the high levelbecause of the non-conductive storage element. On the other hand, if theV_(TH) is lower than the read or verify level, the potential level ofthe concerned bit line (BLe) decreases to a low level, for example, lessthan 0.5 V, because the conductive storage element discharges the bitline. The state of the storage element can thereby be detected by avoltage comparator sense amplifier that is connected to the bit line.

The erase, read and verify operations described above are performedaccording to techniques known in the art. Thus, many of the detailsexplained can be varied by one skilled in the art. Other erase, read andverify techniques known in the art can also be used.

FIG. 20 depicts an example set of threshold voltage distributions.Example V_(TH) distributions for the storage element array are providedfor a case where each storage element stores two bits of data. A firstthreshold voltage distribution E is provided for erased storageelements. Three threshold voltage distributions, A, B and C forprogrammed storage elements, are also depicted. In one embodiment, thethreshold voltages in the E distribution are negative and the thresholdvoltages in the A, B and C distributions are positive. Note that thissimplified example refers to four states. However, additional states,e.g., 16, 32, 64 or more can be used.

Each distinct threshold voltage range corresponds to predeterminedvalues for the set of data bits. The specific relationship between thedata programmed into the storage element and the threshold voltagelevels of the storage element depends upon the data encoding schemeadopted for the storage elements. For example, U.S. Pat. No. 6,222,762and U.S. Patent Application Publication No. 2004/0255090, published Dec.16, 2004, both of which are incorporated herein by reference in theirentirety, describe various data encoding schemes for multi-state flashstorage elements. In one embodiment, data values are assigned to thethreshold voltage ranges using a Gray code assignment so that if thethreshold voltage of a floating gate erroneously shifts to itsneighboring physical state, only one bit will be affected. One exampleassigns “11” to threshold voltage range E (state E), “10” to thresholdvoltage range A (state A), “00” to threshold voltage range B (state B)and “01” to threshold voltage range C (state C). However, in otherembodiments, Gray code is not used. Although four states are shown, thepresent invention can also be used with other multi-state structuresincluding those that include more or less than four states.

Three read reference voltages, Vra, Vrb and Vrc, are also provided forreading data from storage elements. By testing whether the thresholdvoltage of a given storage element is above or below Vra, Vrb and Vrc,the system can determine the state, e.g., programming condition, thestorage element is in. With four states, three read reference voltagesor compare points are used. With sixteen states, fifteen compare pointsare used, and so on. These are compare points of the type referred to atsteps 1102, 1110, 1132, 1150, 1172, 1192, 1478 and 1486, discussedpreviously.

Further, three verify reference voltages, Vva, Vvb and Vvc, areprovided. Additional verify points can be used for storage elementswhich store data indicative of additional programming states. Whenprogramming storage elements to state A, the system will test whetherthose storage elements have a threshold voltage greater than or equal toVva. When programming storage elements to state B, the system will testwhether the storage elements have threshold voltages greater than orequal to Vvb. When programming storage elements to state C, the systemwill determine whether storage elements have their threshold voltagegreater than or equal to Vvc.

In one embodiment, known as full sequence programming, storage elementscan be programmed from the erase state E directly to any of theprogrammed states A, B or C. For example, a population of storageelements to be programmed may first be erased so that all storageelements in the population are in erased state E. A series ofprogramming pulses such as depicted by the control gate voltage sequenceof FIG. 24 will then be used to program storage elements directly intostates A, B or C. While some storage elements are being programmed fromstate E to state A, other storage elements are being programmed fromstate E to state B and/or from state E to state C. When programming fromstate E to state C on WLn, the amount of parasitic coupling to theadjacent floating gate under WLn−1 is a maximized since the change inamount of charge on the floating gate under WLn is largest as comparedto the change in voltage when programming from state E to state A orstate E to state B. When programming from state E to state B the amountof coupling to the adjacent floating gate is reduced but stillsignificant. When programming from state E to state A the amount ofcoupling is reduced even further. Consequently the amount of correctionrequired to subsequently read each state of WLn−1 will vary depending onthe state of the adjacent storage element on WLn. The process shown canbe extended to additional states as well.

FIG. 21 illustrates an example of a two-pass technique of programming amulti-state storage element that stores data for two different pages: alower page and an upper page. Four states are depicted: state E (11),state A (10), state B (00) and state C (01). The process shown can beextended to additional states as well. For state E, both pages store a“1.” For state A, the lower page stores a “0” and the upper page storesa “1.” For state B, both pages store “0.” For state C, the lower pagestores “1” and the upper page stores “0.” Note that although specificbit patterns have been assigned to each of the states, different bitpatterns may also be assigned.

In a first programming pass, the storage element's threshold voltagelevel is set according to the bit to be programmed into the lowerlogical page. If that bit is a logic “1,” the threshold voltage is notchanged since it is in the appropriate state as a result of having beenearlier erased. However, if the bit to be programmed is a logic “0,” thethreshold level of the storage element is increased to be state A, asshown by arrow 2100. That concludes the first programming pass.

In a second programming pass, the storage element's threshold voltagelevel is set according to the bit being programmed into the upperlogical page. If the upper logical page bit is to store a logic “1,”then no programming occurs since the storage element is in one of thestates E or A, depending upon the programming of the lower page bit,both of which carry an upper page bit of “1.” If the upper page bit isto be a logic “0,” then the threshold voltage is shifted. If the firstpass resulted in the storage element remaining in the erased state E,then in the second phase the storage element is programmed so that thethreshold voltage is increased to be within state C, as depicted byarrow 2120. If the storage element had been programmed into state A as aresult of the first programming pass, then the storage element isfurther programmed in the second pass so that the threshold voltage isincreased to be within state B, as depicted by arrow 2110. The result ofthe second pass is to program the storage element into the statedesignated to store a logic “0” for the upper page without changing thedata for the lower page. In both FIG. 20 and FIG. 21, the amount ofcoupling to the floating gate on the adjacent word line depends on thefinal state.

In one embodiment, a system can be set up to perform full sequencewriting if enough data is written to fill up an entire page. If notenough data is written for a full page, then the programming process canprogram the lower page programming with the data received. Whensubsequent data is received, the system will then program the upperpage. In yet another embodiment, the system can start writing in themode that programs the lower page and convert to full sequenceprogramming mode if enough data is subsequently received to fill up anentire (or most of a) word line's storage elements. More details of suchan embodiment are disclosed in U.S. Patent Application Pub. No.2006/0126390, titled “Pipelined Programming of Non-Volatile MemoriesUsing Early Data,” published Jun. 15, 2006, incorporated herein byreference in its entirety.

FIGS. 22 a-c disclose another process for programming non-volatilememory that reduces the effect of floating gate to floating gatecoupling by, for any particular storage element, writing to thatparticular storage element with respect to a particular page subsequentto writing to adjacent storage elements for previous pages. In oneexample implementation, the non-volatile storage elements store two bitsof data per storage element, using four data states. The process showncan be extended to additional states as well. For example, assume thatstate E is the erased state and states A, B and C are the programmedstates. State E stores data 11. State A stores data 01. State B storesdata 10. State C stores data 00. This is an example of non-Gray codingbecause both bits change between adjacent states A and B. Otherencodings of data to physical data states can also be used. Each storageelement stores two pages of data. For reference purposes, these pages ofdata will be called upper page and lower page; however, they can begiven other labels. With reference to state A, the upper page stores bit0 and the lower page stores bit 1. With reference to state B, the upperpage stores bit 1 and the lower page stores bit 0. With reference tostate C, both pages store bit data 0.

The programming process is a two-step process. In the first step, thelower page is programmed. If the lower page is to remain data 1, thenthe storage element state remains at state E. If the data is to beprogrammed to 0, then the threshold of voltage of the storage element israised such that the storage element is programmed to state B′. FIG. 22a therefore shows the programming of storage elements from state E tostate B′. State B′ is an interim state B; therefore, the verify point isdepicted as Vvb′, which is lower than Vvb.

In one embodiment, after a storage element is programmed from state E tostate B′, its neighbor storage element (WLn+1) in the NAND string willthen be programmed with respect to its lower page. For example,referring to FIG. 2, after the lower page for storage element 106 isprogrammed, the lower page for storage element 104 would be programmed.After programming storage element 104, the floating gate to floatinggate coupling effect will raise the apparent threshold voltage ofstorage element 106 if storage element 104 had a threshold voltageraised from state E to state B′. This will have the effect of wideningthe threshold voltage distribution for state B′ to that depicted asthreshold voltage distribution 2250 of FIG. 22 b. This apparent wideningof the threshold voltage distribution will be remedied when programmingthe upper page.

FIG. 22 c depicts the process of programming the upper page. If thestorage element is in erased state E and the upper page is to remain at1, then the storage element will remain in state E. If the storageelement is in state E and its upper page data is to be programmed to 0,then the threshold voltage of the storage element will be raised so thatthe storage element is in state A. If the storage element was inintermediate threshold voltage distribution 2250 and the upper page datais to remain at 1, then the storage element will be programmed to finalstate B. If the storage element is in intermediate threshold voltagedistribution 2250 and the upper page data is to become data 0, then thethreshold voltage of the storage element will be raised so that thestorage element is in state C. The process depicted by FIGS. 22 a-creduces the effect of floating gate to floating gate coupling becauseonly the upper page programming of neighbor storage elements will havean effect on the apparent threshold voltage of a given storage element.An example of an alternate state coding is to move from distribution2250 to state C when the upper page data is a 1, and to move to state Bwhen the upper page data is a 0.

Although FIGS. 22 a-c provide an example with respect to four datastates and two pages of data, the concepts taught can be applied toother implementations with more or fewer than four states and differentthan two pages.

FIG. 23 is a flow chart describing one embodiment of a method forprogramming non-volatile memory. In one implementation, storage elementsare erased (in blocks or other units) prior to programming. In step2300, a “data load” command is issued by the controller and inputreceived by control circuitry 1610. In step 2305, address datadesignating the page address is input to decoder 1614 from thecontroller or host. In step 2310, a page of program data for theaddressed page is input to a data buffer for programming. That data islatched in the appropriate set of latches. In step 2315, a “program”command is issued by the controller to state machine 1612.

Triggered by the “program” command, the data latched in step 2310 willbe programmed into the selected storage elements controlled by statemachine 1612 using the stepped program pulses of the pulse train 2400 ofFIG. 24 applied to the appropriate selected word line. In step 2320, theprogram voltage, V_(PGM), is initialized to the starting pulse (e.g., 12V or other value) and a program counter (PC) maintained by state machine1612 is initialized at zero. In step 2330, the first V_(PGM) pulse isapplied to the selected word line to begin programming storage elementsassociated with the selected word line. If logic “0” is stored in aparticular data latch indicating that the corresponding storage elementshould be programmed, then the corresponding bit line is grounded. Onthe other hand, if logic “1” is stored in the particular latchindicating that the corresponding storage element should remain in itscurrent data state, then the corresponding bit line is connected toV_(dd) to inhibit programming.

In step 2335, the states of the selected storage elements are verified.If it is detected that the target threshold voltage of a selectedstorage element has reached the appropriate level, then the data storedin the corresponding data latch is changed to a logic “1.” If it isdetected that the threshold voltage has not reached the appropriatelevel, the data stored in the corresponding data latch is not changed.In this manner, a bit line having a logic “1” stored in itscorresponding data latch does not need to be programmed. When all of thedata latches are storing logic “1,” the state machine (via the wired-ORtype mechanism described above) knows that all selected storage elementshave been programmed. In decision step 2340, a check is made as towhether all of the data latches are storing logic “1.” If all of thedata latches are storing logic “1,” the programming process is completeand successful because all selected storage elements were programmed andverified. A status of “PASS” is reported in step 2345.

If, in step 2340, it is determined that not all of the data latches arestoring logic “1,” then the programming process continues. In decisionstep 2350, the program counter PC is checked against a program limitvalue PCmax. One example of a program limit value is twenty; however,other numbers can also be used. If the program counter PC is not lessthan PCmax, then the program process has failed and a status of “FAIL”is reported in step 2355. If the program counter PC is less than PCmax,then V_(PGM) is increased by the step size and the program counter PC isincremented in step 2360. The process then loops back to step 2330 toapply the next V_(PGM) pulse.

FIG. 24 depicts an example pulse train 2400 applied to the control gatesof non-volatile storage elements during programming. The pulse train2400 includes a series of program pulses 2405, 2410, 2415, 2420, 2425,2430, 2435, 2440, 2445, 2450, . . . , that are applied to a word lineselected for programming. In one embodiment, the programming pulses havea voltage, V_(PGM), which starts at 12 V and increases by increments,e.g., 0.5 V, for each successive programming pulse until a maximum of 20V is reached. In between the program pulses are verify pulses. Forexample, verify pulse set 2406 includes three verify pulses. In someembodiments, there can be a verify pulse for each state that data isbeing programmed into, e.g., state A, B and C. In other embodiments,there can be more or fewer verify pulses. The verify pulses in each setcan have amplitudes of Vva, Vvb and Vvc (FIG. 21) or Vvb′ (FIG. 22 a),for instance.

As mentioned, the voltages which are applied to word lines to implementa boost mode are applied when programming occurs, e.g., prior to andduring a program pulse. In practice, the boost voltages of a boost modecan be initiated slightly before each program pulse and removed aftereach program pulse. On the other hand, during the verify process, forinstance, which occurs between program pulses, the boost voltages arenot applied. Instead, read voltages, which are typically less than theboost voltages, are applied to the unselected word lines. The readvoltages have an amplitude which is sufficient to maintain thepreviously programmed storage elements in a NAND string on when thethreshold voltage of a currently-programmed storage element is beingcompared to a verify level.

The foregoing detailed description of the invention has been presentedfor purposes of illustration and description. It is not intended to beexhaustive or to limit the invention to the precise form disclosed. Manymodifications and variations are possible in light of the aboveteaching. The described embodiments were chosen in order to best explainthe principles of the invention and its practical application, tothereby enable others skilled in the art to best utilize the inventionin various embodiments and with various modifications as are suited tothe particular use contemplated. It is intended that the scope of theinvention be defined by the claims appended hereto.

1. A method for operating non-volatile storage, comprising: retrievingstored data from a set of non-volatile storage elements which is incommunication with a selected word line in a memory array by: applyingat least a first pre-conditioning waveform to at least one non-volatilestorage element in the set of non-volatile storage elements inassociation with a first read operation; performing the first readoperation on the set of non-volatile storage elements; performing asecond read operation on the set of non-volatile storage elements; anddetermining programmed states of non-volatile storage elements in theset of non-volatile storage elements responsive to the first and secondread operations.
 2. The method of claim 1, wherein: the performing thefirst read operation comprises applying a series of read referencevoltages to the selected word line, the read reference voltages aredifferent than verify voltages; and the at least a firstpre-conditioning waveform is applied a specified time before at leastone of the read reference voltages in the series.
 3. The method of claim1, wherein: the performing the first read operation comprises applying aseries of read reference voltages to the selected word line, the readreference voltages are different than verify voltages; and at least onepre-conditioning waveform is applied a specified time before each readreference voltage in the series.
 4. The method of claim 1, wherein: theat least a first pre-conditioning waveform is applied to at least one ofa control gate, source and drain of the at least one non-volatilestorage element.
 5. The method of claim 1, wherein: the at least a firstpre-conditioning waveform is applied to the at least one non-volatilestorage element via a p-well of a substrate on which the set ofnon-volatile storage elements is formed.
 6. The method of claim 1,wherein: the at least a first pre-conditioning waveform precedes thefirst read operation; and the second read operation is performed withouta preceding associated pre-conditioning waveform.
 7. The method of claim1, wherein: the first and second read operations each determine whethervoltage thresholds of the non-volatile storage elements in the set ofnon-volatile storage elements are above or below each of a plurality ofdifferent voltage threshold levels.
 8. The method of claim 1, wherein:the determining the programmed states comprises providing code wordsbased on the first read operation and performing an iterativeprobabilistic decoding process for decoding the code words, theiterative probabilistic decoding process is responsive to the secondread operation.
 9. The method of claim 8, further comprising: providinginitial reliability metrics comprising logarithmic likelihood ratios foruse in the iterative probabilistic decoding process based on programmingstates determined by the first read operation; and adjusting theiterative probabilistic decoding process subsequently based onprogramming states determined by the second read operation.
 10. Themethod of claim 8, further comprising: providing initial reliabilitymetrics comprising logarithmic likelihood ratios for use in theiterative probabilistic decoding process based on programming statesdetermined by the first read operation and based on programming statesdetermined by the second read operation.
 11. A method for operatingnon-volatile storage, comprising: performing a plurality of senseoperations on a set of non-volatile storage elements for each of aplurality of programming states, at least one pre-conditioning waveformis provided for at least one of the sense operations; providing a set ofreliability metrics comprising logarithmic likelihood ratios based onthe sense operations; and storing the set of reliability metrics for useby an iterative probabilistic decoding process in determining aprogramming state of at least one non-volatile storage element in theset of non-volatile storage elements in at least one subsequent senseoperation.
 12. The method of claim 11, wherein: the at least one of thesense operations is preceded by the at least one pre-conditioningwaveform; and at least one other of the sense operations is not precededby an associated pre-conditioning waveform.
 13. The method of claim 11,wherein: at least one other of the sense operations is preceded by atleast one associated pre-conditioning waveform; and the at least onepre-conditioning waveform provided for the at least one sense operationis applied to the at least one non-volatile storage element via a p-wellof a substrate on which the at least one non-volatile storage element isformed, and the at least one pre-conditioning waveform associated withthe at least one other of the sense operations is applied to at leastone of a control gate, a source and a drain of the at least onenon-volatile storage element.
 14. The method of claim 11, wherein: atleast one other of the sense operations is preceded by at least oneassociated pre-conditioning waveform; and the at least onepre-conditioning waveform provided for the at least one sense operationis applied to one terminal of the at least one non-volatile storageelement, and the at least one pre-conditioning waveform associated withthe at least one other of the sense operations is applied to anotherterminal of the at least one non-volatile storage element in the set ofnon-volatile storage elements.
 15. A method for operating non-volatilestorage, comprising: applying at least a first pre-conditioning waveformto at least one non-volatile storage element in association with a firstread operation for retrieving stored data from the at least onenon-volatile storage element, the at least a first pre-conditioningwaveform has an effect on the at least one non-volatile storage elementwhich replicates, at least in part, an effect that a program pulse hason the at least one non-volatile storage element prior to a verifyoperation, where the effect on the at least one non-volatile storageelement comprises an increase in noisy behavior of the at least onenon-volatile storage element; performing the first read operation on theat least one non-volatile storage element; and determining a programmingstate of the at least one non-volatile storage element responsive to thefirst read operation using iterative probabilistic decoding.
 16. Themethod of claim 15, wherein: the determining the programming statecomprises providing a code word based on the first read operation andperforming the iterative probabilistic decoding to decode the code word.17. A non-volatile storage system, comprising: a set of non-volatilestorage elements in communication with a selected word line in a memoryarray; and one or more control circuits in communication with the set ofnon-volatile storage elements, the one or more control circuits, toretrieve stored data from the set of non-volatile storage elements:apply at least a first pre-conditioning waveform to at least onenon-volatile storage element in the set of non-volatile storage elementsin association with a first read operation, perform the first readoperation on the at least one non-volatile storage element, perform asecond read operation on the at least one non-volatile storage element,and determine programmed states of non-volatile storage elements in theset of non-volatile storage elements responsive to the first and secondread operations.
 18. The non-volatile storage system of claim 17,wherein: the one or more control circuits perform the first readoperation by applying a series of read reference voltages to theselected word line, the read reference voltages are different thanverify voltages; and the at least a first pre-conditioning waveform isapplied a specified time before at least one of the read referencevoltages in the series.
 19. The non-volatile storage system of claim 17,wherein: the one or more control circuits perform the first readoperation by applying a series of read reference voltages to theselected word line, the read reference voltages are different thanverify voltages; and at least one pre-conditioning waveform is applied aspecified time before each read reference voltage in the series.
 20. Thenon-volatile storage system of claim 17, wherein: the at least a firstpre-conditioning waveform is applied to at least one of a control gate,source and drain of the at least one non-volatile storage element. 21.The non-volatile storage system of claim 17, wherein: the at least afirst pre-conditioning waveform is applied to the at least onenon-volatile storage element via a p-well of a substrate on which theset of non-volatile storage elements is formed.
 22. The non-volatilestorage system of claim 17, wherein: the at least a firstpre-conditioning waveform precedes the first read operation; and thesecond read operation is performed without a preceding associatedpre-conditioning waveform.
 23. The non-volatile storage system of claim17, wherein: the first and second read operations each determine whethervoltage thresholds of the non-volatile storage elements in the set ofnon-volatile storage elements are above or below each of a plurality ofdifferent voltage threshold levels.
 24. The non-volatile storage systemof claim 17, wherein: the one or more control circuits determine theprogramming states by providing code words based on the first readoperation and performing an iterative probabilistic decoding process fordecoding the code words, the iterative probabilistic decoding process isresponsive to the second read operation.
 25. The non-volatile storagesystem of claim 24, wherein: the one or more control circuits provideinitial reliability metrics comprising logarithmic likelihood ratios foruse in the iterative probabilistic decoding process based on programmingstates determined by the first read operation, and adjust the iterativeprobabilistic decoding process subsequently based on programming statesdetermined by the second read operation.
 26. The non-volatile storagesystem of claim 24, wherein: the one or more control circuits provideinitial reliability metrics comprising logarithmic likelihood ratios foruse in the iterative probabilistic decoding process based on programmingstates determined by the first read operation and based on programmingstates determined by the second read operation.
 27. The non-volatilestorage system of claim 17, wherein: the at least a firstpre-conditioning waveform is applied to the at least one non-volatilestorage element via a p-well of a substrate on which the set ofnon-volatile storage elements is formed.
 28. A non-volatile storagesystem, comprising: a set of non-volatile storage elements; and one ormore control circuits in communication with the set of non-volatilestorage elements, the one or more control circuits perform a pluralityof sense operations on the set of non-volatile storage elements for eachof a plurality of programming states, at least one pre-conditioningwaveform is provided for at least one of the sense operations, provide aset of reliability metrics comprising logarithmic likelihood ratiosbased on the sense operations, and store the set of reliability metricsfor use by an iterative probabilistic decoding process in determining aprogramming state of at least one non-volatile storage element in theset of non-volatile storage elements in at least one subsequent senseoperation.
 29. The non-volatile storage system of claim 28, wherein: theat least one of the sense operations is preceded by the at least onepre-conditioning waveform; and at least one other of the senseoperations is not preceded by an associated pre-conditioning waveform.30. The non-volatile storage system of claim 28, wherein: at least oneother of the sense operations is preceded by at least one associatedpre-conditioning waveform; and the at least one pre-conditioningwaveform provided for the at least one sense operation is applied to theat least one non-volatile storage element via a p-well of a substrate onwhich the at least one non-volatile storage element is formed, and theat least one pre-conditioning waveform associated with the at least oneother of the sense operations is applied to at least one of a controlgate, a source and a drain of the at least one non-volatile storageelement.
 31. The non-volatile storage system of claim 28, wherein: eachof the sense operations comprises a read operation which reads back aprogrammed state which has been programmed into at least onenon-volatile storage element in the set of non-volatile storageelements, and the read operation makes a determination as to whether avoltage threshold of the at least one non-volatile storage element isabove or below each of a plurality of different voltage thresholdlevels.
 32. The non-volatile storage system of claim 28, wherein: atleast one other of the sense operations is preceded by at least oneassociated pre-conditioning waveform; and the at least onepre-conditioning waveform provided for the at least one sense operationis applied to one terminal of the at least one non-volatile storageelement, and the at least one pre-conditioning waveform associated withthe at least one other of the sense operations is applied to anotherterminal of the at least one non-volatile storage element.
 33. Thenon-volatile storage system of claim 28, wherein: the one or morecontrol circuits obtain voltage threshold profiles of the at least onenon-volatile storage element for each of the plurality of programmingstates, the set of reliability metrics is based on the voltage thresholdprofiles.
 34. The non-volatile storage system of claim 28, wherein: atleast one other of the sense operations is preceded by at least oneassociated pre-conditioning waveform which differs from the at least onepre-conditioning waveform provided for the at least one sense operationin at least one characteristic.
 35. A non-volatile storage system,comprising: a set of non-volatile storage elements; and one or morecontrol circuits in communication with the set of non-volatile storageelements, the one or more control circuits: (a) apply at least a firstpre-conditioning waveform to at least one non-volatile storage elementin the set of non-volatile storage elements in association with a firstread operation to retrieve stored data from the at least onenon-volatile storage element, the at least a first pre-conditioningwaveform has an effect on the at least one non-volatile storage elementwhich replicates, at least in part, an effect that a program pulse hason the at least one non-volatile storage element prior to a verifyoperation, (b) perform the first read operation on the at least onenon-volatile storage element, and (c) determine a programming state ofthe at least one non-volatile storage element responsive to the firstread operation using iterative probabilistic decoding, where the one ormore control circuits access initial reliability metrics comprisinglogarithmic likelihood ratios for use in the iterative probabilisticdecoding, the initial reliability metrics are developed by applyingpre-conditioning waveforms in association with read operations to theset of non-volatile storage elements.
 36. The non-volatile storagesystem of claim 35, wherein: the one or more control circuits determinethe programming state by providing a code word based on the first readoperation and performing the iterative probabilistic decoding to decodethe code word.
 37. A non-volatile storage system, comprising: a set ofnon-volatile storage elements; and one or more control circuits incommunication with the set of non-volatile storage elements, the one ormore control circuits: (a) apply at least a first pre-conditioningwaveform to at least one non-volatile storage element in the set ofnon-volatile storage elements in association with a first read operationto retrieve stored data from the at least one non-volatile storageelement, the at least a first pre-conditioning waveform has an effect onthe at least one non-volatile storage element which replicates, at leastin part, an effect that a program pulse has on the at least onenon-volatile storage element prior to a verify operation, the effect onthe at least one non-volatile storage element comprises an increase innoisy behavior of the at least one non-volatile storage element, (b)perform the first read operation on the at least one non-volatilestorage element, and (c) determine a programming state of the at leastone non-volatile storage element responsive to the first read operationusing iterative probabilistic decoding.